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COMPOUNDS FOR IMMUNOTHERAPY AND DIAGNOSIS 
. OF BREAST CANCER AND METHODS FOR THEIR USE 

TECHNICAL FIELD 

The present invention relates generally to compositions and methods for the 
treatment, and diagnosis of bpeast cancer. Jhe invention is more particularly related to 
polypeptides comprising at least a portion of a protein that is preferentially expressed in 
breast tumor tissue and to polynucleotide molecules encoding such polypeptides. Such 
polypeptides may be used in vaccines and pharmaceutical compositions for treatment of 
. breast, cancer. Additionally such polypeptides and polynucleotides may be used in the 
immunodiagposis of breast cancer. 

BACKGROUND OF THE INVENTION 

Breast cancer is a significant health problem for women in the United States 
and throughout the world. Although advances have been made in detection and treatment of 
the disease, breast cancer remains the second leading cause of cancer-related deaths in 
women, affecting more than 180,000 women in the United States each year. For women in 
. North America, the life-time odds of getting breast cancer are now one in eight. 

. No vaccin^.or other uiiiversally successful method for the prevention or 
treatinent of breast cancer is currently available. Managenient of the disease currently relies 
on a combination of early diagnosis (through routine breast screening procedures) and 
aggressive treatment, which may include one or more of a variety of treatments such as 
svirgery, radiotherapy, chemotherapy and hormone therapy. The course of trjeatmerit for a 
particular breast cancer is often selected based on a variety of prognostic parameters, 
including an analysis of specific tumor markers. See, e.g.. Porter- Jordan and Lippman, 
Breast Cancer 5:73-100 (1994). However, the use of established markers often leads to a 
result that is difficult to interpret, and the high mortality observed in breast cancer patients 
indicates that improvements are needed in the treatment, diagnosis and prevention of the 
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Accordingly, there is a need in the art for improved methods for therapy and 
diagnosis of breast cancer. The present invention fulfills these needs and further provides 
other related iadvantages. 

SUMMARY OF THE INVENTION 

The present invention provides compounds and rnethods for immunotherapy 
of breast cancer. In one aspect, isolated polypeptides are provided comprising at least an 
inimunogenic portion of a breast tumor protein or a variant of said protein that differs only in 
conservative substitutions and/or niodificatiohs, wherein the breast tumor protein comprises 
an amino acid sequence encoded by a polynucleotide molecule having a partial sequence 
selected from the group consisting of (a) nucleotide sequences recited in SEQ ID NOS: 3, 10, 
17, 24, 45-52 and 55-67, 72, 73, and 89-94, (b) complements of said nucleotide sequences 
and (c) sequences that hybridize to a sequence of (a) or (b) under moderately stringent 
conditions. 

In related aspects, isolated polynucleotide molecules encoding the above 
polypeptides are provided. In specific embodiments, such polyriucleotide molecules have 
partial sequences provided in SEQ ID NOS: 3, 10, 17, 24, 45-52 and 55-67, 72, 73, and 89- 
94. The present invention further provides expression vectors comprising the above 
polynucleotide molecules and host cells irahsformed or transfected with such expression 
vectors. In preferred embodiments, the host cells are selectied from the group consisting ofE. 
coli, yeast and mammalian cells. 

In Einother aspect, the present invention provides fiision proteins comprising a 
first and a second inventive polypeptide or, alternatively, an inventive polypeptide and a 
known breast antigen. 

The present invention also provides pharmaceutical compositions comprising 
at least one of the above polypeptides, or a polynucleotide molecule encoding such a 
polypeptide, and a physiologically acceptable carrier, together with vaccines comprising at 
least one or more such polypeptide or polynucleotide molecule in combination with a non- 
specific immune response enhancer. Pharmaceutical compositions and vaccines comprising 
one or more of the above fusion proteins are also provided. 
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In related aspects, phannaceutical compositions for the treatment of breast 
cancer comprising at least one polypeptides and a physiologically acceptable carrier are 
provided, wherein the polypeptide comprises an.inununogenic portion of a breast tumor 
protein or a variant thereof, the. breast timior. protein being encoded by a polynucleotide 
molecule having a partial sequence selected from the group consisting: of: (a) nucleotide 
sequences recited in SEQ ID NOS: ,1, 2, 4-9, 11-16, ,18-23, 25-44, 53, 54, 68-71, and 74-88, 
(b) complements of said nucleotide sequences,, and (c) sequences that hybridize to a sequence 
of (a) or (b) \mder moderately stringent conditions. The invention also provides vaccines for 
the treatment of breast cancer comprising such polypeptides in combination with a non- 
specific immune response enhancer, together with phannaceutical compositions and vaccines 
comprising at least one polynucleotide molecule having a partial sequence provided in SEQ 
ID. NOS: 1, 2, 4-9, 1 1-16,1 8-23, 25-44, 53, 54, 68r71, and 74-88, 

i In yet another aspect, methods are provided for inhibiting the development of 
breast cancer in a patient, comprising administering an effective amoiint of at least one of the 
above phannaceutical compositions arid/or vaccines. 

The present invention also provides methods for immunodiagnosis of breast 
cancer, together with kits for use in .such methods. . In one specific aspect of the present 
invention, methods are provided for detecting breast cancer in a patient, comprising: (a) 
contacting a biological sample obtained jfrom a patient with a binding agent that is capable of 
binding .to one of the inventive polypeptides; and (b) detecting in the sample a protein or 
poljrpeptide that binds to the binding agent. In preferred embodiments, the binding agent is 
an antibody, most preferably a monoclonal antibody. 

In related aspects, methods are provided for monitoring the progression of 
breast cancer in a patient, comprising: (a) contacting a biological sample obtained from a 
patient with a binding agent that is capable of binding to one of the above polypeptides; (b) 
determining in the sample an amount of a protein or polypeptide that binds to the binding 
agent; (c) repeating steps (a) and (b); and comparing the amounts of polypeptide detected in 
steps (b) and (c). 

Within related aspects, the present invention provides antibodies, preferably 
monoclonal antibodies, that bind to the inventive polypeptides, as well as diagnostic kits 
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comprising such antibodies, and methods of using such antibodies to inhibit the development 
of breast cancer; 

The present invention further provides methods for detecting breast cancer 
comprising; (a) obtaining a, biological sample from a patient; (b) contacting the sample with 
a first and a second oligonucleotide primer in a polymerase chain reaction, at least one of the 
oligonucleotide primers being specific for a DNA molecule that encodes one of the above 
polypeptides; and (c) detecting in the sample a DNA sequence that amplifies in the presence 
of the first and second oligonucleotide primers. In a preferred embodiment, at leiast one of the 
oligonucleotide primers comprises at least about 10 contiguous nucleotides of a DNA 
molecule having a partial sequence selected from the group consisting of SEQ ID NOS: 1 -94. 

In a further aspect, the present invention provides a method for detecting 
breast cancer in a patient comprising: (a) obtaining a biological sample from the patient; (b) 
contacting the sample with an oligonucleotide probe specific for a polynucleotide molecule 
that encodes oiie of the above polypeptides; and (c) detecting in the sample a polynucleotide 
sequerice that hybridizes to the oligonucleotide probe. Preferably, the oligonucleotide probe 
comprises at least about 15 contiguous nucleotides of a DNA molecule having a partial 
sequence selected from the group consisting of SEQ ID NOS: 1 -94. 

In related aspects, diagnostic kits comprising, the above oligonucleotide probes 
or primers are provided. 

These and other aspects of the present invention will become apparent upon 
reference to the following detailed description; All references disclosed herein are hereby 
incorporated by reference in their entirety as if each was incorporated individually. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1 A and B show the specific lytic activity of a first and a second B51 IS-specific 
CTL clone, respectively, measured on autologous LCL transduced vnth B511s (filled 
sqmures) or HLA-A3 (open squares). 
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DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is generally directed to compositions 
and methods for the immunotherapy and diagnosis of breast cancer. The inventive 
compositions are generally isolated polypeptides that comprise at least a portion of a breast 
tianor protein. Also included within the present invention are molecules (such as an antibody 
or fragment thereof) that bind to the inventive polypeptides. Such molecules are referred to 
herein as "binding agents." 

In particular, the subject invention diseloses polypeptides comprising .at least a 
portion of a himian breast tuinor protein, or a vmant thereof, wherein the breast tumor protein 
includes an amino acid sequence encoded by a polynucleotide molecule . including a 
sequence selected firotn the group consisting of: nucleotide sequences recited in SEQ ID 
NOS: 1- 94, the complements of said nucleotide sequerices, and variants thereof. As used 
herein, the term "polypeptide" enconapasses amino acid chsuns of any length, including full 
length proteins, vvherein the amino acid residues are linked by covalent peptide bonds. Thus, 
a polypeptide copapri^ing a portion of one of the above breast proteins may consist entirely of 
, the portion, or the portion may be present within a larger polypeptide that contains additional 
sequences. The additional sequences may be derived from the. tiative protein or may be 
heterologous, and such sequences may be immunoreactive and/or antigenic. 

As used herein, an "immunogenic portion" of a human breast tumor protein is 
a portion that is capable of eliciting an immune response in a patient inflicted with breast 
cancer and as such binds to antibodies present within sera from a breast cancer patient.. Such 
immunogenic portions generally comprise at least about 5 jiminp acid residues, more 
preferably at least about 10, and most preferably at least about 20 amino acid residues. 
Immunogenic portions of the proteins described herein may be identified in antibody binding 
assays. Such assays may generally be performed using any of a variety of means known to 
those of ordinary skill in the art, as described, for ex^ple, in Harlow and Lane, Antibodies: 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1988. For 
example, a polypeptide may be immobilized on a solid support (as described below) and 
contacted with patient sera to allow binding of antibodies within the sera to the immobilized 
polypeptide. Unboimd sera may then be removed and boimd antibodies detected using, for 
example, '"l-labeled Protein A. Alternatively, a polypeptide may be used to generate 
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monoclonal and. poly clonal antibodies for use in detection of the polypeptide in blood or 
other fluids of breast cancer patients. Methods for preparing and identifying immunogenic 
portions of antigens of known sequence are well known in the art arid include those 
summaiized in Paul, Fundamental Tmmunoldgy, 3'" ed.. Raven Press, 1 993, pp. 243-247.- 

The term "polynucleotide(s)," as used herein, means a single or double- 
stranded polymer of deoxyribonucleotide or ribonucleotide bases and includes DNA and 
corresponding RNA molecules, including HnRNA and mRNA molecules, both sense and 
anti-sense strands, and comprehends cDNA, genomic DNA and recombinant DNA, as well as 
wholly or partially synthesized polynucleotides. An HnRNA molecule contains introns and 
corresporids to a DNA molecule in a generally one-to-one manner. An mRNA molecule 
corresponds to an HnRNA and DNA molecule from which the introns have been excised. A 
polynucleotide may consist of an entire gene, or any portion thereof. Operable antirsense 
Ijolyniicleotides may comprise a fragment of the correspondinjg polynucleotide, and the 
definition of "polynucleotide" therefore includes all such operable anti-sense fragments. 

The compositions and methods of the present invention also encompass 
variants of the above polypeptides and polynucleotides. A polypeptide "variant," as used 
herein, is a polypeptide that differs from the recited polypeptide only in conservative 
substitutions and/or modifications, such that the therapeutic, antigenic and/or immvinogenic 
properties of the polypeptide are retained. In a preferred embodiment, variant polypeptides 
differ from an identified sequence by substitution, deletion or addition of five amino acids or 
fewer. Such variants may generally be identified by modifying one of the above polypeptide 
sequences, and evaluating the antigenic properties of the modified polypeptide using, for 
example, the representative procedures described herein. Polypeptide variants preferably 
exhibit at least about 70%, more preferably at least about 90% and most preferably at least 
about 95% identity (determined as described below) to the identified polypeptides. 

As used herein, a "conservative substitution" is one in which an amino acid is 
substituted for another amino acid that has similar properties, such that one skilled in the art 
of peptide chemistry would expect the secondary structure and hydropathic nature of the 
polypeptide to be substantially unchanged. In genbral, the following groups of amino acids 
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represent conservative changes: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; (2) cys, ser, tyr, 
thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, his. 

Variants may also, or alternatively, contain other modifications, including the 
deletion or addition of amino acids that have minimal influence on the antigenic properties, 
secondary structure and hydropathic nature of the polypeptide. For example, a polypeptide 
may be conjugated to a signal (or leader) sequence at the N-terminal end of the protein which 
co-translationally or post-translationally directs transfer of the protein. The polypeptide may 
also be conjugated to a linker or other sequence for ease of synthesis, purification or 
identification of the polypeptide {e.g., poly-His), or to enhance binding of the polypeptide to a 
solid support. For example, a polypeptide may be conjugated to an immunoglobulin Fc 
region. 

A nucleotide "variant" is a sequence that differs from the recited, nucleotide 
sequence in having one or more nucleotide, deletions, substitutions or additions. Such 
modifications may be readily, introduced using standard mutagenesis techniques, such as 
oligonucleotide-directed site-specific mutagenesis as taught, for example, by Adelman et al. 
(DNA, 2:183, 1983). Nucleotide variants may be naturally occurring allelic variants, or non- 
naturally occurring variants. Variant nucleotide sequences preferably exhibit , at least about 
70%, more preferably at least about 80% and most preferably at least about 90% identity 
(determined as described, below) to the recited sequence. 

The antigens provided by the present invention include variants that are 
encoded by DNA sequences which are substantially homologous to one or more of the DNA 
sequences specifically recited herein. "Substantial homology," as used herein, refers to DNA 
sequences that are capable of hybridizing under moderately stringent conditions. Suitable 
moderately stringent conditions include prewashing in a solution of 5X SSC, 0.5% SDS, 
1.0 mM EDTA (pH 8.0); hybridizing at SO^C-eS^C, 5X SSC. overnight or, in the event of 
cross-species homology, at 45*'C with 0.5X SSC; followed by washing twice at 65°C for 20 
minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% SDS. Such hybridizing DNA 
sequences are also within the scope of this invention, as aic nucleotide sequences that, due to 
code degeneracy, encode an. immunogenic polypeptide that is encoded by a hybridizing DNA 
sequence. 
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Two nucleotide or polypeptide sequences are said to be "identical" if the 
sequence of nucleotides or amino acid residues in the two sequences is the same when aligned 
for fnaximvim correspondence as described below. Comparisons between two sequences are 
typically performed by comparing the sequences over a comparison window to identify and 
compare local regions of sequence similarity. A ''comparison window" as used herein, refers 
to a segment of at least about 20 contiguous positions, usually 30 to about - 75, more 
preferably 40 to about 50, in which a sequence may be compared to a reference sequence of 
the same number of contiguous positions after the two sequences are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using the 
Megaligh program in the Lasergene suite of bioinfoiinatics software (DNASTAR, Inc., 
Madison, WI), using default parameters. This program embodies several alignment schemes 
described in the following references: Dayhoff, M.O. (1978) A model of evolutionary change 
in proteins - Matrices for detecting distant relationships: In Dayhoff, M.O. (ed.) Atlas of 
Protein Sequence and Structure, National Biomedical Resarch Foundaiton, Washington DC 
Vol. 5, Suppl. i, pp. 345-358; Hein J. (1990) Unified Approach to Alignment and Phylogenes 
pp. 626-645 Methods in Enzymology vol. 183, Academic Press, Inc., San Diego, CA; 
Higgins, D.G. and Sharp, P.M. (1989) Fast and sensitive multiple sequence alignments on a 
microcomputer CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) Optimal ahghttients 
in linear space CABIOS 4:11-17; Robinson, E.D. (1971) Comb. Theor 77:105; Santou, N. 
Nes, M. (1987) The neighbor joining method. A new method for reconstructing phylogenetic 
trees Mo/. Biol. Evol. 4:406-A25; Sneath, P.H.A. and Sokal, R.R. {1973)' Numerical 
Taxonomy - the Principles and Practice of Numerical Taxonomy, Freeman Press, San 
Francisco, CA; Wilbur, W.J. and Lipman, D.J. (1983) Rapid similarity searches of nucleic 
. acid and protein data banks Proc. Natl. Acad, Sci. USA 50:726-730. 

Preferably, the "percentage of sequence identity" is determined by comparing 
two optimally aligned sequences over a window of comparison of at least 20 positions, 
wherein the portion of the polynucleotide sequence in the cbrnparison vandow may comprise 
additions or deletions (i.e. gaps) of 20 percent or less* usually 5 to 15 percent, or 1.0 to 12 
percent, as compared to the referrace sequences (which does not comprise, additions or 
deletions) for optimal alignment of the two sequences. The percentage is calculated by 
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determining the number of positions at which the identical nucleic acid biises or amino acid 
residue -occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the reference. sequence (i.e. 
the window size) and multiplying the results by 100 to yield the percentage of sequence 
identity. 

Also included in the scope of the present invention are alleles of the genes encoding 
the nucleotide sequences recited herein. As used herein, an "idlele" or "allellic sequence" is 
.an alternative form of the gene which may result from at least one mutation in the nucleic 
acid sequence. Alleles may result in altered mRNAs or polypeptides whose structure or 
■■ function, may or may not be altered. Any given gene may have none, one, or niany allelic 
forms; Common mutational changes which give rise to alleles are generally ascribed to 
natural deletions, additions, or . substitutions of nucleotides. Each of these types of changes 
may occur alone or in combination with the others, one or more times in a given sequence. 

For breast tumor polypeptides, with immunoreactive . properties, variants may, 
alternatively, be identified by modifying the amino acid sequence of one of the above 
polypeptides, and evaluating the immunoreactivity of the modified polypeptide. For breast 
tumor polypeptides useful for the generation of diagnostic binding agents, a variant may be 
identified by evaluating a modified polypeptide for &e ability to generate antibodies that 
detect the presence or absence of br^t cancer. Such modified sequences may be prepared 
and tested using, for example, the representative procedures described herein. 

The breast tumor proteins of the present invention, and polynucleotide 
molecules encoding such proteins, may be isolated, from breast tuinor tissue using any of a 
variety of methods well known in the art. Polynucleotide sequences corresponding to a gene 
(or a portion thereof) encoding one of the inventive breast tumor proteins may be isolated 
from a breast tumor cDNA library using a subtraction technique as described in detail below. 
Examples of such DNA sequences are provided in SEQ ID NOS: 1- 94. Partial 
polynucleotide sequences thus obtained may be used to design oligonucleotide primers for 
the amplification of full-length polynucleotide sequences in a polymerase chain reaction 
(PCR), using techniques well known in the art (see, for example, Mullis et al.. Cold Spring 
. Harbor Symp. Quant. Biol., 57:263, 1987; Erlich ed., PCR Technology, Stockton Press, NY, 
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1 989). Once a polynucleotide sequence encoding a polypeptide is obtained, any of the above 
modifications may be readily introduced using standard mutagenesis techniques, such as 
oligonucleotide-directed site-specific mutagenesis as taught, for example, by Adelman et al. 
(DJV^ 2:183, 1983). 

The breast tumor polypeptides disclosed herein may also be generated by 
synthetic or recombinant means. Synthetic polypeptides having fewer than about 100 amino 
acids, and generally fewer than about 50 amino acids, may be generated using techniques 
well known to those of ordinary skill in the art. For example, such polypeptides may be 
synthesized using any of the commercially available solid-phase techniques, such as the 
Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain (see, for example, Merrifield, J. Am. Chem. Soc. 55:2149-2146, 
1963). Equipment for automated synthesis of polypeptides is commercially available from 
suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City, CA), and may be 
operated according to the manufacturer's instructions. 

Altematively, any of the above polypeptides may be produced recombinantly 
by inserting a polynucleotide sequence that encodes the polypeptide into an expression 
vector and expressing the protein in an appropriate host. Any of a variety of expression 
vectors known to those of ordinary skill in the art may be. employed to express recombinant 
polypeptides of this invention. Expression may be achieved in any appropriate host cell that 
has been transformed or transfected with an expression vector containing a polynucleotide 
molecule that encodes a recombinant polypeptide. Suitable host cells include prokaryotes, 
yeast and higher eukaryotic cells. Preferiably, the host cells employed are E. coli, yeast or a 
mammalian cell line, such as CHO cells. The polynucleotide sequences expressed in this 
• manner may encode naturally occurring polypeptides, portions of naturally occurring 
polypeptides, or other variants thereof. 

In general, regardless of the method of preparation, the polypeptides disclosed 
herein are prepared in an isolated, substantially pure form {i.e., the polypeptides are 
homogenous as determined by amino acid composition and primary sequence analysis). 
Preferably, the polypeptides are at least about 90% pure, more preferably at least about 95% 
pure and most preferably at least about 99% pure. In certain preferred embodiments. 
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described in more detail below, the substantially pure polypeptides are incorporated into 
pharmaceutical compositions or vaccines for use in one or more of the methods disclosed 
herein. 

. In a related aspect, the present invention provides fusion proteins comprising a 
first and a second inventive polypeptide or, alternatively, a polypeptide of the present 
invention and a known breast tumor antigen, together with variants of such fusion proteins. 

A polynucleotide; sequence encoding a fusion protein of the . present invention 
is constructed using known recombinant DNA techniques . to assemble separate 
polynucleotide sequences encoding ;the first and second polypeptides into an appropriate 
expression vector. The .3' end of a polynucleotide sequence encoding the first polypeptide is 
ligated, with or without a peptide linker, to the 5' end of a polynucleotide sequence encoding 
the second polypeptide so that the reading frames of the sequences are in phase to permit 
mRNA translation of the two DNA sequences into a single fusion protein that retains the 
biological activity of both the first and the second polypeptides. . . 

. A peptide linker sequence may be employed to separate the first and the 
second polypeptides by a distance sufficient to ensure- that each polypeptide folds ipto its 
secondary and tertiary structures. Such a peptide linker sequence is incorporated into the 
fusion protein using standard techniques well known in the art. Suitable peptide linker 
sequences may be chosen based on the following factors: (1) their lability to adopt a flexible 
extended conformation; (2) their inability to adopt a secondary structure that could interact 
with functional epitopes on the first and second polypeptides; and (3) the lack of hydrophobic 
or charged residues that might react with the polypeptide functional epitopes. Preferred 
peptide linker sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, 
such as Thr and Ala may also be used in the linker sequence. Amino acid, sequences which 
may be usefully employed as linkers include those disclosed in Maratea et al., Gene 40:39-46, 
1985; Murphy etal., Proc. Natl. Acad. Set. USA «:8258-8262, 1986; U.S. Patent 
No. 4,935,233 and U.S. Patent No. 4,751,180. The linker sequence may be from 1 to about 
50 amino acids in length. Peptide sequences are not required, wheii the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to separate 
the functional domains and prevent steric interference. 
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The ligated polynucleotide sequences are operably linked to suitable 
transcriptional or translational regulatory elements. The regulatory elements responsible for 
expression of polynucleotides are located only 5' to the polynucleotide sequence encoding 
the first pblyp^tides. Similarly, stop codons require to end translation and transcription 
termination signals are only present 3' to the polynucleotide sequence encoding the second 
polypeptide. • 

Fusion proteins are also provided that' comprise a polypeptide of the present 
invention together with an unrelated immunogenic protein. Preferably the immunogenic 
protein is capable of eliciting a recall response. Examples of such proteins include tetanus, 
tuberculosis and hepatitis proteins (see, for example, Stoute et al. New Engl. J. Med., 336:86- 
91 (1997)). ■ 

Polypeptides of the present invention that comprise an immunogenic portion 
of £l breast tumor protein may generally be used for immunotherapy of breast cancer, wherein 
the polypeptide stimulates the patient's own immune response to breeist tumor cells. In 
fiirthei- aspects, the present invention provides methods for using one or more of the 
immunoieactive polypeptides encoded by a polynucleotide molecule having a sequence 
provided in SEQ ID NOS: 1- 94 (or fusion proteins comprising one or rhore such 
polypeptides and/or polynucleotides encoding such polypeptides) for immunotherapy of 
breast cancer in a patient. As used herein, a "patient" refers to any warm-blooded animal, 
preferably a humaii. A patient may be afflicted with a disease, or may be free of detectable 
disease: Accordingly, the above immuhoreactive polypeptides (or fusion proteins or 
polynucleotide molecules encoding such polypeptides) may be used to treat breast cancer or 
to inhibit the development of breast cancer. The polypeptides may be administered either 
prior to or following surgical removal of primary tumors and/or treatment by administration 
of radiotherapy and conventional chemotherapeutic drugs. 

In these aspects, the polypeptide or fusion protein is generally present within a 
pharmaceutical composition and/or a vaccine. Pharmaceutical compositions may comprise 
one or more polypeptides, each of which may contain one or more of the above sequences (or 
variants thereof), and a physiologically acceptable carrier. The vaccines may comprise one or 
more of such polypeptides and a non-specific immime response enhancer, wherein tfie non- 
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specific immune response enhancer is capable of eliciting or enhancing an immune response 
to an exogenous antigen. Examples of non-specific-immune response . enhancers include 
adjuvants, biodegradable microspheres (e.g., polylactic galactide) and liposomes (into which 
the polypeptide is incorporated). Pharmaceutical compositions and vaccines may also contain 
other epitopes of breast tumor antigens, either incorporated into a combination polypeptide 
{i.e., a single polypeptide that contains multiple epitopes) or present within a separate 
polypeptide. 

Alternatively, a pharmaceutical composition or vaccine may contain 
polynucleotides encoding one. or more of the.above polypeptides, such that the polypeptide is 
generated in situ. In such pharmaceutical compositions and vaccines,, the polynucleotide 
may be present within any of a variety of delivery systems known to those of ordinary skill in 
the art, including nucleic acid expression systems, bjacteria and viral expression systems. 
Appropriate nucleic acid expression systems contain the necessary polynucleotide sequences 
for expression in the patient (such as a suitable promoter). Bacterial delivery systems involve 
the administration of a bacterium (such as Bacillus-Calmette-Guerrin) that expresses an 
epitope of a breast tumor cell antigen on its cell surface. In a preferred embodiment, the 
polynucleotide molecules may be introduced using a viral expression system {e.g., vaccinia 
or other pox virus, retrovirus, or adenovirus), which may involve the use of a non-pathogenic 
(defective), replication competent virus. Suitable systems are disclosed, for example, in 
Fisher-Hoch et al., PNAS 55:317-321, 1989; Flexner et al., Ann. N.Y. Acad. Sci. 5<yP:86-103, 
1989; Flexner et aL, Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 
5,017,487; WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; 
WO 91/02805; Berkner, Biotechniques 5:616-627, 1988; Rosenfeld et al.. Science 
252:431-434, 1991; Kolls et al., PAMS. P7:215-219, 1994; . Kass-Eisler . et al., PNAS 
P0:1 1498-1 1502, 1993; Guzman et al.. Circulation 55:2838-2848, 1993; and Guzman et al., 
Cir. Res, 75:1202-1207, 1993. Techniques for. incorporating polynucleotides into such 
expression systems are well known to those of ordinary skill in the art. The polynucleotides 
may also be "naked," as described, for example, in published PCT application WO 90/1 1092, 
and Ulmer et al.. Science 2JP: 1745-1 749, 1993, reviewed by Cohen, Science 25P: 1691 -1692, 
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1993. The uptake of naked polynucleotides may be increased by coating the polynucleotides 
onto biodegradable beads, which are efTiciently transported into the cells. 

Routes and frequency of administration, as well as dosage, will vary from 
individual to individual and may parallel those currently being used in immunotherapy of 
other diseases. In general, the pharmaceutical compositions and vaccines may be 
administered by injection (e.g., intracutaneo\is, intramuscular, intravenous or subcutaneous), 
intranasally {e.g., by aspiration) or orally. Between 1 and 10 doses may be administered over 
a 3-24 week period. Preferably, 4' doses are administered, at an interval of 3 months, and 
booster administrations may be given periodically thereafter. Alternate protocols may be 
appropriate for individual patients. A suitable dose is an amount of polypeptide or 
polynucleotide molecule that is effective to raise an immune response (cellular and/or 
humorial) against breast tumor cells in a treated patient. A suitable immune response is at 
least 10-50% above the basal (i.e.; untreated) level. In general, the amount of polypeptide 
present in a dose (or produced in situ by the polynucleotide in a dose) ranges from about 1 pg 
to about 100 mg per kg of host, typically from about 10 pg to about 1 mg, and preferably 
froiri about 100 pg to iabout 1 ng. Suitable dose sizes will vary with the size of the patient, 
but will typically range from about 0.01 mL to about 5 mL. 

While any suitable carrier known to those of ordinary skill in the art may be 
employed in the pharmaceutical compositions of this invention, the type of carrier will vary 
depending on the mode of administration. For parenteral administration, such as 
' subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a lipid, a wax 
and/or a buffer. For oral administration, any of the above carriers or a solid carrier, such as 
maimitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, cellulose, glucose, 
sucrose, and/or magnesium carbonate, may be employed. Biodegradable microspheres (e.g., 
polylactic glycolide) may also be employed as carriers for the pharmaceutical compositions 
of this invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 
Patent Nos. 4,897,268 and 5.075,109. 

Any of a variety of non-specific immune response enhancers may be employed 
in the vaccines of this invention. For example, an adjuvant may be included. Most adjuvants 
contain a substance designed to protect the antigen from rapid catabolism, such as aluminiun 
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hydroxide or mineral oil, and a nonspecific stimulator of immune response, such as lipid A, 
Bordello pertussis or Mycobacterium tuberculosis. Such adjuvants are commercially 
available as, for example, Freund's Incomplete Adjuvant and Complete Adjuvant (Difco 
Laboratories, Detroit, MI) and Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ). 

Polypeptides disclosed herein may also be entiployed in adoptive 
immunotherapy for the treatment of cancer. Adoptive imnaunotherapy may be broadly 
classified into either active or passive immunotherapy. In. active immunotherapy, treatment 
relies on the in vivo stimulation of the endogenous host immune system to react against 
' tumors with the administration of immime response-modifying agents (for example, tumor 
vaccines, bacterial adjuvants, and/or cytokines). 

In passive, immunotherapy,., treatment involves the delivery of biologic 
reagents with established.tumor-immune reactivity (such as effector cells, or antibodies) that 
can directly or indirectly mediate antitumor effects and does not necessarily depend on an 
intact host immune system. Examples of effector cells include, T lymphocytes (for example, 
CD8+ cytotoxic T-lymphocyte, CD4+ T-helper, tumor-infiltrating lymphocytes), killer cells 
(such as Natural Killer cells, lymphokine-actiyated killer cells), B cells, or antigen presenting 
cells (such as dendritic cells and macrophages) expressing the disclosed antigens. The 
polypeptides disclosed herein may also be used to generate antibodies or anti-idiotypic 
antibodies (as in U.S. Patent No. 4,918,164), for passive immimotherapy. 

The predominant method of procuring adequate numbers of T-cells for 
adoptive immunotherapy is to grow immime T-cells in vitro. Culture conditions for 
expanding single antigen-specific T-cells to several billion in number with retention of 
antigen recognition in vivo are well known in the art. These in vitro culture conditions 
typically utilize intermittent stimulation with antigen, often in the presence of cytokines, such 
as IL-2, and non-dividing feeder cells. As noted above, the immunoreactive polypeptides 
described herein may. be used to rapidly expand antigen-specific T cell cultures in order to 
generate sufficient number of cells for immunotherapy. In particular, antigen-presenting 
cells, such as dendritic, macrophage or B-cells, may be pulsed with immunoreactive 
polypeptides or transfected with a polynucleotide sequence(s), using standard techniques well 
known in the art. For example, antigen presenting cells may be transfected with a 
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polynucleotide sequence, wherein said sequence contains a promoter region appropriate for 
increasing expression, and can be expressed as part of a recombinant virus or other expression 
system. For cultured' T-cells to be effective in therapy, the 'cultured T-cells must be able to 
grow and distribute widely arid to survive long term in vivo. Studies have demonstrated that 
cultured T-cells can be induced to grow in vivo and to survive long term in substantial 
numbers by repeated stimulation with antigen supplemented with IL-2 (see, for example, 
Cheever, M., et al, "Therapy With Cultured T Cells: Principles Revisited," Immunological 
Reviews. /57:177, 1997). 

The polypeptides disclosed herein may also be employed to generate and/or 
isolate tumor-reactive T-cells, which can then be administered to the patient. -■ In one 
technique, antigen-specific T-cell lines may be generated by in vivo inununization with short 
peptides corresponding to immunogenic portions of the disclosed polypeptides. The resulting 
antigeii specific CD8+ CTL clones may be isolated from the patient, expanded using standard 
tissue culture techniques, and returned to the patient. 

Alternatively, peptides corresponding to iriMnunogenic portions of the 
polypeptides may be employed to generate tumor reactive T cell subsets by selective- j« vitro 
stimulation and expansion of autologous T cells to provide antigen-specific T cells which 
may be subsequently transferred to the patient as described, for example, by Chang et al. 
(Crit. Rev. Oncol. Hemaloi, 22{3\ 213, 1996). Cells of the immtine systeni, such as T cells, 
may be isolated from the peripheral blood of a patient, using a commercially available cell 
separation system, such as CellPro Incorporated's (Bothell, WA) CEPRATE™ system (see 
U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 91/161 16 and WO 
92/07243). The separated cells are stimulated with one or more of the immunoreactive 
polypeptides contained vsdthin a delivery vehicle, such as a microsphere, to provide antigen- 
specific T cells. The population of tumor antigen-specific T cells is then expanded using 
standard techniques and the cells are administered back to the patient. 

In another embodiment, T-cell and/or antibody receptors specific for the 
polypeptides can be cloned, expanded, and transferred into other vectors or effector cells for 
use in adoptive immunotherapy. 

In a ftirther embodiment, syngeneic or autologous dendritic cells may be 
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pulsed with peptides corresponding to at leeist an immunogenic. portion; of a polypeptide 
disclosed herein. The resuhing antigen-specific dendritic cells may either be transferred into 
a patient, or employed to stimulate T cells to provide antigen-specific T cells which may, in 
turn, be administered, to a patient. The use of peptide-pulsed dendritic cells to generate 
antigen-specific T cells, and the subsequent use of such antigen-specific T cells to eradicate 
tumors in a murine model has been demonstrated by Cheever et al. Immunological Reviews, 
.157:m, 1997), . . 

Additionally, vectors expressing the disclosed polynucleotides may be intt-pduced into 
stem cells taken from the patient and cjonally propagated /« viVro for autologous transplant 
back into the same patient. 

Polypeptides of the present invention may also, or alternatively, be used to 
generate binding agents, such as antibodies or fragments thereof, that are capable of detecting 
metastatic human breast tumors. Binding agents of the present invention may generally be 
prepared using methods known to those of ordmary skill in the art, including the 
representative procedures described herein. Binding agents are capable of differentiating 
between patients with and without breast cancer, using the representative assays described 
herein. In other words, antibodies or other, binding agents raised against a breast tumor 
protein, or a suitable portion thereof, will generate a signal indicating the presence of primary 
or metastatic breast cancer in at least about 20% of patients afflicted with the disease, and will 
generate a negative signal indicating the absence of the disease in at least , about 90% of 
individuals without primary or metastatic breast cancer. Suitable portions of such breast 
tumor proteins are portions that are able to gerierate a binding agent that indicates the 
presence of primary or metastatic breast cancer in substantially all {i.e., at least about 80%, 
and preferably at least about 90%) of the patients for which breast cancer wouM be indicated 
using the full length protein, and that indicate the absence of breast cancer in substantially all 
of those samples that would . be negative when tested with full length protein. The 
representative assays described below, such as the two-antibody saiidwich > assay, may 
generally be employed for evaluating the ability of a binding agent to detect metastatic human 
breast tumors; 
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The ability of a polypeptide prepared as described herein to geneirate 
antibodies capable of detecting primary or metastatic human breast tumors may generally be 
evaluated by raising one or more antibodies against the polypeptide (using, for example, a 
representative method described herein) and determinirig the ability of such antibodies to 
detect such tumors in patients. This determination may be made by assaying biological 
samples from patients with and without primary or metastatic breast cancer for the presence 
of a polypeptide that binds to the generated antibodies. Such test assays may be performed, 
for examiple, using a representative procedure described below. Polypeptides that generate 
antibodies capiable of detecting at least 20% of primiary or metastatic breast tumors by such 
procedures are considered to be useful in assays for detecting primary or metastatic human 
breast turniors. Polypeptide specific antibodies may be used -'alone or in combination to 
improve sensitivity. 

Polypeptides capable of detecting primary or metastatic human breast tvunors 
may be uised as markers for diagnosing breast cancer or for monitoring disease progression in 
patients. In one embodiment, breast cancer in a patient may be diagnosed by evaluating a 
biological sample obtained from the patient for the level of one or more of the above 
polypeptides, relative to a predetermined cut-off value. As used herein, suitable "biological 
samples" include blood, sera and urine. 

The level of one or more of the above polypeptides may be evaluated using 
any binding agent specific for the polypeptide(s). A "binding agent," in the context of this 
invention, is any agent (such as a compound or a cell) that binds to a polypeptide as described 
above. As used herein, "binding" refers to a noncovalent association between two separate 
molecules (each of which may be free (i.e., in solution) or present on the surface of a cell or a 
solid support), such that a "complex" is formed. Such a complex may be free or immobilized 
(either covalently or noncbvalently) on a support material. The ability to bind may generally 
be evaluated by determining a binding constant for the formation of the complex. The 
binding constant is the value obtained when the concentration of the complex is divided by 
the product of the component concentrations. In general, two compounds are said to "bind" 
in the context of the present invention when the binding constant for complex formation 
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exceeds about lO^L/mol. The binding constant may be detennined using methods well 
known to those of ordinary skill in the art. 

Any agent that satisfies the above requirements may be a binding agent. For 
example, a binding agent may be a ribosome with or without a peptide component, an RNA 
molecule or a peptide. In a preferred embodiment, the binding partner is an antibody, or a 
fragment thereof. Such antibodies may be polyclonal, or monoclonal. In addition, the 
antibodies may be. single chain, chimeric, CDR-grafted or humanized. Antibodies may be 
prepared by the methods described herein and by other methods well known to those of skill 
in the art. 

There are a variety of assay formats known to those of ordinary skill in the art 
for using a binding partner to detect polypeptide markers in a sample. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual. Cold Spring Harbor Laboratory, . 1988. In a 
preferred embodiment, the assay involves the use of binding partner immobilized on a solid 
support to bind to and remove the polypeptide from the remainder of the sample. The bound 
polypeptide may then be detected using a second binding partner that contains a reporter 
group. Suitable second binding partners include antibodies that bind to the binding 
partner/polypeptide coimplex. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the.iirunobilized binding 
partner after incubation of the binding partner with the sample. The extent to which 
components of the sample inhibit the binding of the labeled polypeptide to the binding 
partner is indicative of the reactivity of the sample with the immobilized binding partner. 

The solid support may be any material known to those of ordinary skill in the 
art to which the antigen may be attached. For example, the solid support may be a test well in 
a microtiter plate or a nitrocellulose or other suitable membrane. Alternatively, the support 
may be a bead or disc, such as glass, fiberglass, latex or a plastic material such as polystyrene 
or polyvinylchloride. The support may also be a magnetic particle or a fiber optic sensor, 
such as those disclosed, for example, in U.S. Patent No. 5,359,681. The binding agent may 
be immobilized on the solid support using a variety of techniques known, to those of skill in 
the art, which are amply described in the patent and scientific literature. In the context of the 
present invention, the term "immobilization" refers to both noncovalent association, such as 
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adsorption, and covalent attachment (which may be a direct linkage between the antigen and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 
Inunobilization by adsorption to a well in a microtiter plate or to a membrane is preferred. In 
such cases, adsorption may be achieved by contacting the binding agent, in a suitable buffer, 
with the solid support for a suitable amount of time. The contact time varies with 
temperature, but is typically between about 1 hour and about 1 day. In general, contacting a 
well of a plastic microtiter plate (such as polystyrene or polyvinylchloride) with an amount of 
binding agent ranging fiom.about 10 rig t6 about 10 ^g, and preferably about 100 ng to about 
1 Hg, is sufficient to immobilize- an adequate amount of binding agent. 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a bifiinctional reagent that will react with both the 
support and a functional group, such as a hydroxyl or amino group, on the binding agent. For 
example, the binding agent may be covalently attached to supports having an appropriate . 
polymer coating using beiizbquinone or by condensation of an aldehyde group on the support 
with an amine and an active hydrogen on the binding partner {see, e.g.. Pierce 
Irmiiunotechnology Catalog and Handbook, 1 99 1 , at A 1 2-Al 3). 

In certain embodiments, the assay is a two-antibody sandwich assay. This 
assay may be performed by first contacting an antibody that has been immobilized on a solid 
support, commonly the well of a microtiter plate, with the sample, such that polypeptides 
within the sample are allowed to bind to the immobilized antibody. Unbound sample is then 
removed from the immobilized polypeptide-antibody complexes and a second antibody 
(containing a reporter group) capable of binding to a different site on the polypeptide is 
added. The amount of second antibody that remains bound to the solid support is then 
determined using a method appropriate for the specific reporter group. 

More specifically, once the antibody is immobilized on the support as 
described above, the remaining protein binding sites on the support are typically blocked. 
Any suitable blocking agent known to those of ordinary skill in the art, such as bovine serum 
albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The immobilized antibody is 
then incubated with the sainple, and polypeptide is allowed to bind to the antibody. The 
SMnple may be diluted with a suitable diluent, such as phosphate-buffered saline (PBS) prior 
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to incubation. In general, an appropriate contact time {i.e., incubation time) is that period of 
time that is sufficient to detect the presence of polypeptide within a sample obtained from an 
individual with breast cancer. Preferably, the contact time is sufficient to achieve a level of 
binding that is at least about 95% of that achieved at equilibrium between bound and unbound 
polypeptide. Those of ordinary skill in the art will recognize that the time necessary to 
achieve equilibrium may be readily determined by assaying the level of binding that occurs 
over a period of time. At room temperature, an incubation time of about 30 minutes is 
generally sufficient. 

Unbound sample may then be removed by washing the solid support vdth an 
appropriate buffer, such as PBS containing 0.1% Tween 20™. The second antibody, which 
contains a reporter group, inay then be added to the solid support. Preferred reporter groups 
include enzymes (such as horseradish peroxidase), substrates, cofactors, inhibitors, dyes, 
radionuclides, luminescient groups, fluorescent groups and biotin. . The conjugation of 
antibody to reporter group., may be achieved using standard methpds known to those of 
ordinary skill in the art.. 

The second antibody is then incubated with the immobilized antibody- 
polypeptide complex for an amoxuit of time sufficient to detect the bound polypeptide. An 
. appropriate amount of time may generally be determined by assaying the level of binding that 
occurs over a period of time. Unbound second antibody is then removed and bound second 
antibody is detected using the reporter group. The method employed for detecting the 
reporter group depends upwn the nature of the reporter group. For radioactive groups, 
scintillation counting or autoradiographic methods are generally {^propriate. Spectroscopic 
methods may be used to detect dyes, luminescent groups and fluorescent groups. Biotin may 
be detected using avidin, coupled to a different reporter group (commonly a radioactive or 
fluorescent group or an enzyme). Enzyine reporter groups may generally be detected by the 
addition of substrate (generally for a specific period of time), followed by spectroscopic or 
other analysis of the reaction products. 

To determine the presence or absence of breast cancer, the signal detected 
from the reporter group that remains , bound to the solid support is generally compared to a 
signal that corresponds to a predetermined cut-off value. In one preferred embodiment, the 
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cut-ofT value is the average mean signal obtained when the immobilized antibody is incubated 
with samples from patients without breast cancer. In general, "a sample generating a signal 
that is three standard deviations above the predetermined cut-ofF value is considered positive 
for breast cancer. In an alternate preferred embodiment, the cut-off value is determined using 
a Receiver Operator Curve, according to the method of Sackett et al.. Clinical Epidemiology: 
A Basic Science for Clinical Medicine, Little Brown and Co., 1985, p. 106-7. Briefly, in this 
embodiment, the cut-off value may be determined from a plot of pairs of true positive rates 
(i.e., sensitivity) and false positive rates (100%-specificity) that correspond to each possible 
cut-ofF value for the diagnostic test result. The cut-off value on the plot that is the closest to 
the upper left-hand comer (i.e., the value that encloses the largest area) is the most accurate 
cut-off value, and a sample generating a signal that is higher than the cut-off value 
determined by this method may be considered positive. Alternatively, the cut-off value may 
be shifted to the left along the plot, to minimize the false positive rate, or to the right, to 
minimize the false negative rate. In general, a sample generating a signal that is higher than 
the cut-off value determined by this method is considered positive for breast cancer. 

In a related embodiment, the assay is performed in a flow-through or strip test 
format, wherein the antibody is immobilized on a membrane, such as nitrocellulose. In the 
flow-through test, polypeptides within the sample bind to the immobilized antibody as the 
sample passes through the membrane. A second, labeled antibody then binds to the antibody- 
polypeptide complex as a solution containing the second antibody flows through the 
membrane. The detection of bbvmd second antibody may then be performed as described 
above. In the strip test format, one end of the membrane to which antibody is bound is 
immersed in a solution containing the sample. The sample migrates along the membrane 
through a region containing second antibody and to the area of immobilized antibody. 
Concentration of second antibody at the area of immobilized antibody indicates the presence 
of breast cancer. Typically, the concentration of second antibody at that site generates a 
pattern, such as a line, that can be read visually. The absence of such a pattem indicates a 
negative result. In general, the amount of antibody immobilized on the membrane is selected 
to generate a visually discernible pattem when the biological sample contains a level of 
polypeptide that would be sufficient to generate a positive signal in the two-antibody 
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sandwich assay, in the format discussed above. Preferably, the amount of antibody 
immobilized on the membrane ranges from about 25 ng to about 1 |ig, and more preferably 
from about 50 ng to about 500 ng. Such tests can typically be performed with a very small 
amount of biological sample. 

Of course, numerous other assay protocols exist that are suitable for use with 
the antigens or antibodies of the present invention. The above descriptions are intended to be 
exemplary only. 

In another embodiment, the above polypeptides may be used as markers for 
the progression of breast cancer. In this embodiment, assays as described above for the 
diagnosis of breast cancer may be performed overtime, and the change in the level of reactive 
polypeptide(s) evaluated. For example, the assays may be performed every 24-72 hours for a 
period of 6 months to 1 year, and thereafter performed as needed. In general, breast cancer is 
progressing in those patients in whom the level of polypeptide detected by the binding agent 
increases over time. In contrast, breast cancer is not progressing when the level of reactive 
polypeptide either i-emains constant or decreases with time. . 

Antibodies for use in the above methods may be prepared by any of a variety 
of techniques known to those of ordinary skill in the art. See, e.g., Harlow and Lane, 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In one such 
technique, an immunogen comprising the antigenic polypeptide is initially injected into any 
of a wide variety of mammals {e.g., mice, rats, rabbits, sheep and goats). In this step, the 
polypeptides of this invention may serve as the immunogen without modification. 
Alternatively, particularly for relatively short polypeptides, a superior immune response may 
be elicited if the polypeptide is joined to a carrier protein, such as bovine serum albumin or 
keyhole limpet hemocyanin. nic immunogen is injected into the animal host, preferably 
according to a predetermined schedule incorporating one or more booster immunizations, and 
the animals are bled periodically. Polyclonal antibodies specific for the polypeptide may then 
be purified from such antisera by, for example, affinity chromatography using the polypeptide 
coupled to a suitable solid support. 

Monoclonal antibodies specific for the antigenic polypeptide of interest may 
be prepared, for example, using the technique of Kohler and Milstein, Eur. J. Immunol. 
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5:511-519, 1976, and improvements thereto. Briefly, these methods involve the preparation 
of immortal cell lines capable of producing antibodies having the desired specificity {i.e., 
reactivity with the polypeptide of interest). Such cell lines may be produced, for example, 
from spleen cells obtained from an animal immunized as described above. The spleen cells 
are then immortalized byi for example, fusion with a myeloma cell fusion partner, preferably 
one that is syngeneic with the immunized animal. A variety of fusion techniques may be 
employed. For example, the spleen cells and myeloma cells may be combined with a 
nonionic detergent for a few minutes and then plated at low density on a selective medium 
that supports the growth of hybrid cells, but not myeloma cells. A preferred selection 
technique uses HAT (hypoxanthine, aminopterin, thymidine) selection. After a sufficient 
time, usually about 1 to 2 weeks, colonies of hybrids are observed. Single colonies are 
selected and tested for binding activity against the polypeptide. Hybiidomas having high 
reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the , supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the yield, 
such as injection of the hybridoma Cell line into the peritoneal cavity of a suitable vertebrate 
host, such as a mouse. Monoclonal antibodies may then be harvested from the ascites fluid or 
the blood. Contaminants may be removed from the antibodies by conventional techniques, 
such as chromatography, gel filtration, precipitation, and extraction. The polypeptides of this 
invention may b6 used in the purification process in, for example, an affinity chromatography 
step. 

Monoclonal antibodies of the present invention may also be used as 
therapeutic reagents, to diminish or eliminate breast tumors. The antibodies may be used on 
their own (for instance, to inhibit metastases) or coupled to one or more therapeutic agents. 
Suitable agents in this regard include radionuclides, differentiation inducers, drugs, toxins, 
and derivatives thereof. Preferred radionuclides include ''Y, '^'l, '"I, "'I, '"Re, "'Re, ^"At, 
and ^"Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorboi esters and butyric acid. Preferred toxins 
include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella 
toxin, and pokeweed antiviral protein. 
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. A therapeutic agent may be coupled (e.g., covalently bonded) to a suitable 
monoclonal antibody either directly or indirectly (e.g., via a linker group). A direct reaction 
between an agent and an antibody is possible when each possesses a substituent capable of 
reacting with the other. For example, a nucleophilic group, such as an amino or sulfhydryl 
group, on one may be capable of reacting with, a carbonyl-containing group, such as an 
anhydride or an acid halide, or with an alkyl group containing a good leaving group (e.g., a 
halide) on the other. 

. Alternatively, it may be desirable to couple a therapeutic agent and an 
antibody via a linker group. A linker group can function as a spacer to distance an antibody 
from an agent in order to avoid interference with binding capabilities. A linker group can 
also serve to increase the chemical reactivity of a substituent on an agent or an antibody, and 
thus increase the coupling efficiency. An increase in chemical reactivity may also facilitate 
the use of agents, or functioned groups on agents, which otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of bifunctional or 
polyfunctional reagents, both homo- and hetero-functional (siich as those described in the 
catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker group. 
Coupling may be effected, for example, through amino groups, carboxyl groups, sulfhydryl 
groups or oxidized carbohydrate residues. There are numerous references describing such 
methodology, e.g., U.S. Patent No. 4,671,958. to Rodwell et al. 

Where a therapeutic agent is more potent when free from the antibody portion 
of the immunoconjugates of the present invention, it may be desirable to use a linker group 
which is cleavable during or upon internalization into a cell. A number of different cleavable 
linker groups have been described. The mechanisms for the intracellulju- release of an agent 
from these linker groups include cleavage by reduction of a disulfide bond (e.g., U.S. Patent 
No. 4,489,710, to Spitler), by irradiation of a photolabile bond (e.g., U.S. Patent 
No. 4,625,014, to Senter et al.), by hydrolysis of derivatized amino acid side chains (e.g., U.S. 
Patent No. 4,638,045, to Kohn et al.), by serum complement-mediated hydrolysis (e.g., U.S. 
Patent No. 4,671,958, to Rodwell etal.), and acid-catalyzed hydrolysis (e.g., U.S. Patent 
No. 4,569,789, to Blattler et al.). 
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It may be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple nnolecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunbconjugates with more than one agent may 
be prepared in a variety of ways. For example, more than one agent may be coupled directly 
to an antibody molecule, or linkers which provide multiple sites for attachment can be used. 
Alternatively, a carrier can be used. 

A' carrief may bear the agents in a variety of ways, including covalent bonding 
either directly or via a linker group. Suitable carriers include proteins such as albumins (e.g., 
U.S: Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides such as aminodextran 
(e.g., U.S. Patent No. 4,699,784, to Shih et al.). A carrier may also bear an agent by 
noncovalent bonding or by encapsulation, such as within a liposome vesicle {e.g., U.S. Patent 
Nos. 4,429,008- arid' 4,873,088). Carriers specific for radionuclide agents include 
radiohalogenated small molecules and chelating compoimds: For example, U.S. Patent No. 
4,735,792 discloses representative riadiohalogenated small molecules and their synthesis, A 
radionuclide chelate may be formed from chelating compounds that include those containing 
nitrojgen and sulfur atoms as the donor atoms for binding the metal, or metal oxide, 
radionuclide. For example, U.S. Patent No. 4,673,562; to Davison etal. discloses 
representative chelating compounds and their synthesis. 

A variety of routes of administration for the antibodies and immunoconjugates 
may "be used. Typically, administration will be intravenous, intramuscular, subcutaneous or 
in the bed of a ' resected tumor. It will be evident that the precise dose of the 
antibody/immurioconjugate will Vary depending upon the antibody used, the antigen density 
on the tumor, and the rate of clearance of the antibody. 

Diagnostic reagents of the present invention may also comprise polynucleotide 
sequences encoding one or more of the above polypeptides, or one or more portions thereof. 
For example, at least two oHgonucleotide primers may be employed in a polymerase chain 
reaction (PCR) based assay to amplify breast tumor-specific cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for a DNA molecule 
encoding a breast tvunor protein of the present invention. The presence of the amplified 
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cDNA is then detected using techniques well known in the art, such as. gel, electrophoresis. 
Similarly, oligonucleotide probes .specific for a DNA molecule encoding, a breast tumor 
protein of the present invention may be used in a hybridization assay to detect the presence of 
an inventive polypeptide in a biological sample. 

As used, herein, the term "oligonucleotide primer/probe specific for a DNA 
molecule" means an oligonucleotide sequence that has at least about 6Q%, preferably at least 
about 75%:and more preferably at least about 90%,. identity to the DNA rnolecule in question. 
Oligonucleotide primers and/or .probes which may be usefully, employed in the inventive 
diagnostic .methods preiferably . have, at least about 1.0-40 nucleotides..: In a preferred 
embodiment, the. oligonucleotide primers comprise at least about 10 contiguous nucleotides 
of a DNA rnolecule having a partial sequence selected from SEQ ID NOS: 1- 94. Preferably, 
oligonucleotide probes for use in the inventive diagnostic methods comprise at least about 1 5 
contiguous oligonucleotides of a DNA molecule having a partial sequence provided in SEQ 
ID NOS; 1- 94.: ; Techniques for both PGR based assays and. hybridization assays are well 
known in the art (see, for example, Mullis e/.o/. Ibid; Ehrlich, Ibid). . . Primers or probes may 
thus be used to detect, breast tumor-specific sequences in biological samples, including blood, 
urine and/or breiast tumor tissue. 

The following Examples are offered by way of illixstration and not by way of 

limitation. 

EXAMPLES 
■ Example 1 

ISOLATION AND CHARACTERIZATION OF BREAST 
TUMOR POLYPEPTIDES 

This Example describes the isolation of breast tumor polypeptides from a 
breast tumor cDNA library. 

A human breast. tumor cDNA expressioii library was constructed from a pool 
of breast tumor poly A* RNA fi-om three, patients using a Superscript Plasmid System for 
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cDNA Synthesis and Plasmid Cloning kit (BRL Life Technologies, Gaithersburg, MD 20897) 
following the manufacturer's protocol. Specifically, breast tumor tissues were homogenized 
with polytron (Kinematica, Switzerland) and total RNA was extiracied using Trizbl reagent 
(BRL Life Technologies) as directed by the manufacturer. The poly A* RNA was -then 
purified \ising a Qiagen oligotex spin column mRNA purification kit (Qiagen, Santa Clarita, 
CA 91355) according to the manufacturer's protocol. First-strand cDNA was synthesized 
using the Notl/Oligo-dTl 8 primer. Double-strahded cDNA was synthesized, ligated with 
EcoRI/BstX r adaptors (Invitrbgen, Carlsbad, GA) and digested' with Notl. Following size 
fractionation with Chroma Spin- 1000 columns (Clontech, Palo Alto, CA 94303), the cDNA 
was iigated into the EcbRi/Notl site of pCDNAS.l (Ihvitrogen, Carlsbid; CA) and 
transformed' into ElectroMax E. coli DHIOB cells (BRL Life Technologies) by 
electrbporatibn. 

Using the saihe procedure, a normal human breast cDNA expression library 
was prepared from a pool of four normal breast tissue specimens. The cDNA libraries were 
characterized by ' determining the number of indeperident colonies, the percentage of clones 
th^it carried insert, the average insert size and by seqtience analysis. The breast tumor library 
contained 1.14 x 10' independent colonies, with more than 90% of clones having a visible 
insert and the average insert size being 936 base pairs. Th& nbnhal breiast cDNA library 
contained 6x10* independent colonies, with 83% of clones having inserts and the average 
insert size being 101 5 base pairs. Sequencing analysis showed both libraries to contain good 
complex cDNA clones that were synthesized firom mRNA, with minimal rRNA and 
mitochondrial DNA contamination sequencing. 

cDNA library subtraction was performed using the above breast tumor and 
normal breast cDNA Hbraries, as described by Hara et al. {Blood, 5-^:189-199, 1994) with 
some modifications. Specifically, a breast tumor-specific subtracted cDNA library was 
generated as follows. Normal breast cDNA library (70 ng) was digested with EcoRI, Notl, 
and Sful, followed by a filling-in reaction with DNA polymerase Klenow fi-agment. After 
phenol-chloroform extraction and ethanol precipitation, the DNA was dissolved in 100 \x\ of 
H2O, heat-denatured and' mixed with = 100 jil /(lOO |ig) of Photoprobe biotin (Vector 
Laboratories, Burlingame, CA), the resulting mixture was irradiated with a 270 W sunlamp 
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on ice for 20 minutes.. Additional Photoprobe biotin (50 ^l) was added and the biotinylation 
reaction was repeated. After extraction with butanol five times, the DNA was ethanol- 
precipitated and dissolved in 23 \il HjO to form the driver DNA. 

To form the tracer DNA, 10 \ig breast tumor cDNA library was digested with 
BamHI and Xhol, phenol chloroform extracted and passed through Chroma spin-40p columns 
(Clontech). Following ethanol precipitation, the tracer DNA was dissolved in 5 jaI HjO. 
Tracer DNA was mixed with 15 jil driver DNA and 20 nl of 2 x hybridization buffer (1.5 M 
NaGl/10 mM EDTA/50 mM HEPES pH 7.5/0.2% sodium dodecyl sulfate), overlaid with 
mineral oil, and heat-denatured completely. The sample was immediately transferred into a 
68 °C water bath and incubated for 20 hours (long hybridization [LH]). . The reaction mixture 
was then subjected to a streptavidin treatment followed by phenol/chloroform extraction. 
This process was repeated three more times. Subtracted DNA was precipitated, dissolved in 
12 ^il HjO, mixed with S jil driver DNA and 20 fil of 2 x hybridization buffer, and subjected 
to a hybridization at 68 'C for 2 hours (short hybridization [SH]). After removal of 
biotinylated double-stranded DNA, subtracted cDNA was ligated into BamHI/XhoI site of 
chloramphenicol resistant pBCSK* (Stratagene, La Jolla, CA 92037) and transformed into 
ElectroMax E. coli DHIOB cells by electroporation to generate a breast tumor specific 
subtracted cDNA library. 

To analyze the subtracted cDNA library, plasmid DNA was prepared from 1 00 
independent clones, randomly picked from the subtracted breast tumor specific library and 
characterized by DNA sequencing with a Perkin Elmer/Applied Biosystems Division 
Automated Sequencer Model 373A (Foster City, CA). Thirty-eight distinct cDNA clones 
were found in the subtracted breast tumor-specific cDNA library. The determined 3' cDNA 
sequences for 14 of these clones are provided in SEQ ID NO: 1-14, with the corresponding 
5' cDNA sequences being provided in SEQ ID NO: 15-28, respectively. The determined one 
strand (5' or 3') cDNA sequences for the remaining clones are provided in SEQ ID NO: 29- 
52. Comparison of these cDNA sequences with known sequences in the gene bank using the 
EMBL and GenBank databases (Release 97) revealed no significant homologies to the 
sequences provided in SEQ ID NO: 3, 1 0, 1 7, 24 and 45-52. . The sequences provided in SEQ 
ID NO: 1, 2, 4-9, 11-16, 18-23, 25-41 , 43 and 44 were found to show at least some degree of 
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homology to- known human genes. The sequence of SEQ ID NO: 42 was found to show 
some homology to a known yeast gene. 

Data was analyzed iisiiig Synteni provided GEMTOOLS Software. Twenty 
bnis distinct cDNA clones were found- to be over-expressed in breast tumdr and expressed at 
low levels in all normial tissues tested. The determined partial cDNA sequences for these 
clones are provided in SEQ ID NO: 53- 73. Comparison of the sequences of SEQ ID NO: 
53, 54, and 68-71 with those in the gene bank as described above, revealed some homology to 
previdusly identified human genes. No significant homblogies were found to the sequences 
of SEQ lb NO: 55-67, 72 (referred to as JJ 9434,71 17), and 73 (referred to as B535S); ■ 

In a further experiment, cDNA fragments analyzed by DNA microarray. were 

obtained frbrh two subtraction libraries derived by conventional subtraction, as. described 
above. In one instance the tester was derived from primary breast tiunors. In the second 
instance, a metastatic breast tumor was employed as the tester. Drivers consisted of normal 
breast.' ' 

• cDNA fragments from these two libraries were, submitted ais templates, for 
DNA micifdarray analysis. DNA chips were analyzed by hybridizing with fluorescent probes 
derived from mRNA from both tumor and normal tissues. Analysis of the data was 
accomplished by creating three -groups from the sets of probes. The composition of these 
probe groups, referred to' as Breast Tumor/mets, Normal non-breast tissues, and Metastatic 
breast tumoirs. Two comparisons were performed using the modified Gemtools analysis. The 
first comparison was to identify templates with elevated expression in breast tumors. The 
second was to identify templates not recovered in the first comparison that yielded elevated 
expression in metastatic breast tumors. An arbitrary level of increased expression (mean of 
tumor expression versus thie mean of normal tissue expression) was set at approximately 2.2. 

In the first roxmd of comparison to identify overexpression in breast tumors, 
two novel gene sequences were identified, hereinafter referred to as B534S and B538S (SEQ 
ID NO: 89 and 90), and six sequences that showed isome degree of homology to previously 
identified genes (SEQ ID NO: 74-79). Additionally, in a second comparison to. identify 
elevated expression in metastatic breast tumors, five novel; sequences were identified, 
hereinafter referred to as B535S (overexpressed in this analysis as well as what was described 
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above), B542S, B543S, P501S, and B541S (SEQ ID NO: 73, and 91-94), as. well as nine gene 
sequences that showed some homology to known genes (SEQ ID NO: 80-88). Clone B534S . 
and B538S (SEQ ID NO: 89 and 90) were shown to be overexpressed in both breast tumors 
and metastatic breast tumors. 

Example 2 

GENERATION OF HUMAN CD8+ CYTOTOXIC T-CELLS THAT RECOGNIZE 
ANTIGEN PRESENTING CELLS EXPRESSING BREAST TUMOR ANTIGENS 

This Example illustrates the generation of T cells that recognize target cells 
expressing the antigen B511S, also known as 1016-F8 (SEQ ID NO: 56).. Human CD8+ T 
cells were primed iM-vf/ro to the B511S gene product using dendritic cells infected with a 
recombinant vaccinia virus engineered to express B511S as follows (also see Yee et al.. 
Journal of Immunology (1996) 157 (9):4079-86). Dendritic cells (DC) were generated from 
peripheral blood derived monocytes by- differentiation for 5 days in the presence of 50 ng/ml 
GMCSF and 30 ^ig/ml IL-4. DC were harvested, plated in wells of a 24-well plate at a 
density of 2 x 10' cells/well and infected for 12 hours with B511S expressing vaccinia at a 
multiplicity of infection of 5. DC were then matured overnight by the addition of 3 fig/ml 
Cp40-Ligand and UV irradiated at IOOhW for 10 minutes. CD8+ T cells were isolated using 
magnetic beads, and priming cultures were initiated in individual wells (typically in 24 wells 
of a 24-well plate) using 7x10' CD8+ T cells and 1 x 10* irradiated CD8-depleted PBMC; 
IL-7 at 10 ng/ml was added to cultures at day 1. Cultures were re-stimulated every 7-10 days 
using autologous primary fibroblasts retrovirally transduced with B511S and the 
costimulatory molecule B7.1. Cultures were supplemented at day 1 with 15 I,U. of IL-2. 
Following 4 such stimulation cycles, CD8+ cultures were tested for their ability to 
specifically recognize autologous fibroblasts transduced with B511S using an interferon-y 
Elispot assay (see Lalvani et al J. Expaimental Medicine (1997) 186:859-965). Briefly, T 
cells from individual microcultures were added to 96- well Elispot . plates that contained 
autologous fibroblasts transduced to express either B511S or as a negative control antigen 
EGFP, and incubated overnight at 37* C; wells also contained IL-12 at 10 ng/ml. Cultures 
were identified that specifically produced interferon-y only in response to B51 IS transduced 
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fibroblasts; such lines were further expanded and also cloned by limiting dilution on 
autologous B-LCL retrovirally transduced with BSl IS. Lines and clones were identified that 
could specifically recognize autologous B-LCL transduced with BSl IS but riot autologous B- 
LCL transduced with the control antigens EGFP or HLA-A3. An example demonstrating the 
ability of human CTL cell lines derived from such experiments to specifically recognize and 
lyse B5 1 1 S expressing targets is presented in Figure 1 . 

' ■ Example 3 ' 

SYNTHESIS OF POLYPEPTIDES 

Polypeptides may be synthesized on an Perkin Elmer/Applied Biosystems 
Division 430A peptide synthesizer using FMOC chemistry with HPTU (G-Benzotriazole- 
N,N,N',N'-tetramethyluronium hexafluorophosphate) activation. A Gly-Cys-Gly sequence 
may be attached to the amino terminus of the peptide to provide a method of conjugation, 
binding to an immobilized surface, or labeling of the peptide. Cleavage of the peptides from 
the solid support may be carried out using the following cleavage mixture: trifluoiroacetic 
acid:ethanedithiol:tiiioanisole:water:phenol (40:1:2:2:3). After cleaving for 2 hours, the 
peptides may be precipitated in cold methyi-t-butyl-ether. The peptide pellets may then be 
disisolved in water containing 0.1% trifluoroacetic acid (TEA) and lyophilized prior to 
purification by CI 8 reverse phase HPLC. A gradient of 0%-60% acetonitrile (containing 
0.1% TFA) in water (containing 0.1% TFA) may be used to elute the peptides. Following 
lyophilization of the pure fractions, the peptides may be characterized using electrospray or 
other types of mass spectrometry and by amino acid anzilysis. 

From the foregoing, it will be appreciated that, although specific embodiments 
of the invention have been described herein for the purposes of illustration, various 
modifications may be made without deviating from the spirit and scope of the invention. 
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CLAIMS 

1. An isolated . polypeptide comprising an immunogenic portion of a 
breast protein or a variant of said protein that differs only in conservative substitutions and/or 
modifications, wherein said protein comprises an amino acid sequence encoded by a 
polynucleotide molecule comprising a sequence selected from the group consisting of: (a) 
nucleotide sequences recited in SEQ ID NOS: 3, 10, 17, 24, 45-52, 55-67, 72, 73, and 89-94; 
(b) complements of said nucleotide sequences; and (c) sequences that hybridize to a sequence 
of (a) or (b) under moderately stringent conditions. 

2. An isolated polynucleotide molecule comprising a nucleotide sequence 
encoding the polypeptide of claim 1. 

3. An isolated polynucleotide molecule comprising a sequence provided 
in SEQ ID NOS: 3, 10, 17, 24, 45-52, 55-67, 72, 73, and 89-94. , 

4. An expression vector comprising a polynucleotide molecule according 
to any one of claims 2 and 3. 

5. A host cell transformed with the expression vector of claim 4. 

6. The host cell. of claim 5 wherein the host cell is selected from the group 
consisting of E. colt, yeast and mammalian cell lines. 

7. A pharmaceutical composition comprising the polypeptide of claim 1 
and a physiologically acceptable carrier. 

8. A vaccine comprising the polypeptide of claim 1 and a non-specific 
immune response enhancer. 
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9. The vaccine of claim 8 wherein the non-specific immune response 
enhancer is an adjuvant. 

10. A vaccine comprising a polynucleotide molecule of any one of claims 
2 and 3 Mid a non-specific immune response enhancer. 

11. The vaccine of claim 10 wherein the non-specific immune response 
enhancer is an adjuvants 

12. A pharmaceutical composition for the treatment of breast cancer 
comprising a polypeptide and a physiologically acceptable carrier, the polypeptide 
comprising an iinmunogenic portion of a breast protein, wherein said protein comprises an 
amino acid sequence encoded by a polynucleotide molecule comprising a sequence selected 
from the group consisting of: (a) nucleotide sequences recited in SEQ ID NOS: 1, 2, 4-9, 11- 
16, 18-23, 25-44; 53, 54, 68-71, and 74-88; (b) complements of said nucleotide sequences; 
and (c) sequences that hybridize to a sequence of (a) or (b) uiider moderately stringent 
conditions. 

13. A vaccine for the treatment of breast cancer comprising a polypeptide 
and a non-specific immune response enhancer, said polypeptide comprising an immunogenic 
portion of a breast protein, wherein said protein comprises an amino acid sequence encoded 
by a polynucleotide molecule comprising a sequence selected from the group consisting of: 
(a) nucleotide sequences recited in SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23. 25-44, 53, 54, 68- 
71, and 74-88; (b) complements of said nucleotide sequences; and (c) sequences that 
hybridize to a sequence of (a) or (b) under moderately stringent conditions. 

14. The vaccine of claim 13 wherein the non-specific immune response 
enhancer is an adjuvant. 

15. A vaccine for the treatment of breast cancer coinprising a 
polynucleotide molecule and a non-specific immune response enhancer, the polynucleotide 
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molecule comprising a, sequence selected from the group consisting of: (a) nucleotide 
sequences recited in SEQ IDNOS: 1, 2, 4-9, 11-16. .18-23, 25-44, 53. 54, 68-71, and 74-88; 
(b) complements of said nucleotide sequences; and (c) sequences that hybridize to a sequence 
of (a) or (b) under moderately stringent conditions. ' 

16. The vaccine of claim 15, wherein the non-specific immune response 
enhancer is an adjuvjant. 

17. A phamiaceutical composition according to claims 7 or 12, for use in 
the manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

18. A vaccine according to any one of claims 8, 10, 13 or 15, for use in the 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

19. A fusion protein comprising at least one polypeptide according to 
claim 1. * , 

20. A pharmaceutical composition comprising a fusion protein according 
to claim 1 9 and a physiologically acceptable carrier. 

21. A vaccine comprising a fusion protein according to claim 19 and a 
non-specific immune response enhancer. 

22. The vaccine of claim 21 wherein the non-specific immvine response 
enhsuicer is an adjuvant. 

23. A pharmaceutical composition according to claim 20, for use in 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

24. A vaccine according to claim 2 1 , for use in the manufacture of a 
medicament for inhibiting the development of breast cancer in a patient. 
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25. A method for detecting breast cancer in a patient, comprising: 

(a) contacting a biological sample from a patient with a binding agent 
which is capable of binding to a polypeptide, thfe polypeptide comprising an immunogenic 
portion of a breast protein, wherein said protein comprises an amino acid sequence encoded 
by a polynucleotide molecule comprising a sequence selected from the group consisting of 
nucleotide sequences recited in SEQ ID NOS: 1-94, complements of said nucleotide 
sequences and sequences that hybridize to a sequence provided in SEQ ID NO: 1-94 under 
moderately stringent conditions; and 

(b) detecting in the sample a protein or polypeptide that binds to the 
binding agent, thereby detecting breast cancer in the patient. 

26. The method of claim 25 wherein the binding agent is a 
monoclonal antibody. 

27. The method of claim 26 wherein the binding agent is a 
polyclonal antibody. 

28. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) contacting a biological sample from a patient with a binding agent that 
is capable of binding to a polypeptide, said polypeptide comprising aii immunogenic portion 
of a breast protein, wherein said protein comprises an amino acid sequence encoded by a 
polynucleotide molecule comprising a sequence selected from the group consisting of 
nucleotide sequences recited in SEQ ID NOS: 1-94, complements of said nucleotide 
sequences and sequences that hybridize to a sequence provided in SEQ ID NO: 1-94 under 
moderately stringent conditions; 

(b) determining in the sample an amoimt of a protein or polypeptide that 
binds to the binding agent; 

(c) ' repeating steps (a) and (b); and 

(d) ' comparing the amount of polypeptide detected in steps (b) and (c) to 
monitor the progression of breast cancer in the patient. 
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. 29. A monoclonal antibody ^ that binds to. a polypeptide comprising an 
immunogenic portion of a breast protein or a variant of said, protein that differs only in 
conservative substitutions and/or modifications, wherein said protein comprises an amino 
acid sequence encoded by a polynucleotide molecule comprising a' sequence selected from 
the group consisting of: (a) nucleotide sequences recited in SEQ.ID NOS: 3, 10, 17, 24, 45- 
52, 55-67j- 72, 73, and- 89-94:: (b). complements of said nucleotide sequences; and (c) 
sequences that hybridize to a sequence of (a) or (b) under moderately stringent conditions. 

■ • 30. A. monoclonal . antibody according to. claim 29, for .use in the 
manufacture of a medicament for . inhibiting the development of breast cancer in a patient. 

31. The monoclonal antibody of claim 30 vvherein the monoclonal 
antibody is conjugated to a therapeutic agent. 

32. A method for detecting breast cancer in a patient comprising: 

(a) contacting a biological sample from a patient with at least two 
oligonucleotide primers in a polymerase chain reaction, wherein at least one of the 
oligonucleotides is specific for a polynucleotide molecule encoding a polypeptide comprising 
an immunogenic portion of a breast protein, said protein comprising an amino acid sequence 
encoded by a polynucleotide molecule comprising a sequence selected from the group 
consisting of nucleotide sequences recited in SEQ ID NO: 1-94, complements of said 
nucleotide sequences and sequences that hybridize to a sequence of SEQ ID NO: 1 -94 under 
moderately stringent conditions; and 

(b) detecting in the sample a polynucleotide sequence that amplifies in the 
presence of the oligonucleotide primers, thereby detecting breast cancer. 

33. The method of claim 32, wherein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides of a polynucleotide molecule 
comprising a sequence selected from SEQ ID NOS: 1-94. 
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34. ' A diagnostic kit comprising: 

(a) one or more monoclonal antibodies of claim 29; and 

(b) a detection reagrait. 

35. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies that, bind to a polypeptide encoded 
by a polynucleotide molecule comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NOS: 1, 2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, and 74-88, 
compleihents of said sequences and sequences that hybridize fo a sequence of SEQ ID NO: 1, 
2, 4-9, 11-16, 18-23, 25-44, 53, 54, 68-71, or 74-88 iihder nnibderately stringent' conditions; 
and 

(b) a detection reagent. 

36. The kit of claims 34 or 35 wherein the monoclonal antibodies are 
immobilized on a solid support. 

37. ' The kit ofciaim36 wherein the solid support comprises nitrocellulose, 
latex or a plastic material'. 

'38. ' - The kit of claims 34 or 35 wherein the detection reagent comprises a 
reporter group conjugated to a binding agent. 

39. The kit of claim 38 wherein the binding agent is selected from the 
group consisting of anti-immunoglobulms, Protein G, Protein A and lectins. 

40. The kit of claim 38 wherein the reporter group is selected from the 
group consisting of radioisotopes, fluorescent groups, luminescent groups, enzymes, biotin 
and dye particles. 

41. A diagnostic kit comprising at least two oligonucleotide primers, at 
least one of the oligonucleotide primers being specific for a polynucleotide molecule 
encoding a polypeptide comprising an immunogenic portion of a breast protein, said protein 
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comprising an amino acid sequence encoded by a polynucleotide molecule comprising a 
sequence selected from the group consisting of nucleotide sequences recited in SEQ ID NOS: 
1-94, complements of said nucleotide sequences and sequences that hybridize to a sequence 
of SEQ ID NO: 1 -94 under moderately stringent conditions. 

42. A diagnostic kit of claim 41 wherein at least one of the oligonucleotide 
primers comprises at least about 10 contiguous nucleotides of a polynucleptide molecule 
comprising a sequence selected from SEQ ID NOS: 1-94, 

43. A method for detecting breast cancer in a patient, comprising: 

(a) obtaining a biological sample from the patient; . 

(b) contacting the biological sample with an oligonucleotide probe specific 
for a polynucleotide molecule encoding a jpblypeptide comprising an immunogenic portion of 
abreast protein, said protein comprising an amino acid sequence encoded by a polynucleotide 
molecule comprising a sequence selected from the group consisting of nucleotide sequences 
recited in SEQ ID NOS: 1-94, comijlements of said nucleotide sequences and sequences that 
hybridize to a sequence of SEQ ID NO: 1-94 under moderately stringent conditions; and 

(c) detecting in the sample a polynucleotide sequence that hybridizes to 
the oligonucleotide probe, thereby detecting breast cancer in the patient. 

44. The method of claim 43 wherein the oligonucleotide probe comprises 
at least about 15 contiguous nucleotides of a polynucleotide molecule comprising a sequence 
selected from the group consisting of SEQ ID NOS: 1-94. 

45. A diagnostic kit comprising an ohgonucleotidc probe specific for a 
polynucleotide molecule encoding a polypeptide comprising an immunogenic portion of a 
breast protein, said protein comprising an amino acid sequence encoded by a polynucleotide 
molecule comprising a sequence selected from the group consisting of nucleotide sequences 
recited in SEQ ID NOS: 1-94, complements of said nucleotide sequences, and sequences that 
hybridize to a sequence of SEQ ID NO: 1-94 under moderately stringent conditions. 
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46. The diagnostic kit of claim 45, wherein the oligonucleotide probe 
comprises at least about 15 contiguous nucleotides of a polynucleotide molecule comprising a 
sequence selected from the group consisting of SEQ ID NOS: 1-94. 

47. Peripheral blood cells from a patient incubated in the presence of at 
least one polypeptide of claim 1, such that T cells pi-oliferate, for use in the manufacture of a 
medicament for treating breast cancer in a patient. 

48. The blood cells of claim 47 wherein the T cells is repeated one or 

more times. 

49. A composition for the treatment of breast cancer in a patient, 
comprising T. cells proliferated in the presence of a polypeptide of claim 1, in combination 
with a phaimaceutically acceptable carrier. 

50. An antigen presenting cells incubated in the presence of at least one 
polypeptide of claim 1, for use in the manufacture of a medicament for treating breast cancer 
in a patient. 

51. The cells of claim 50 wherein the antigen presenting cells are selected 
from the group consisting of dendritic and macrophage cells. 

52. A composition for the treatment of breast cancer in a patient, 
comprising antigen presenting cells incubated in the presence of a polypeptide of claim 1, in 
combination wth a pharmaceutically acceptable carrier. 
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SEQUENCE LISTING 

<110> Corixa Corporation 

<120> COMPOUNDS FOR IMMUNOTHERAPY AND DIAGNOSIS OF BREAST CANCER AND 
METHODS FOR THEIR USE 

<130> 210121. 446PC 

<140> PCT 
<141> 1998-12-22 

<160> 94 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 402 
<212> DNA 
<213> Homo sapiens 

<400> 1 

tttttttttt tttttaggag aactgaatca aacagatttt attcaacttc 
aaaacaaatn atacgaaacn ngtcataaga aatgctttcc tataccacta 
ctttcaatat tttacaaaat gctcacgcag caaatatgaa aagctncaac 
gttaacttgc tgcaatnaat gcaactttaa canacataca aatttcttct 
agttnaatta ccaatcttaa tgatntcncc caagatnttt attcatatac 
tcnttgccna tacatacnta ttttctttac ttttttttta cnatnggcca 
ngcagnccnc aaaaatctta ccggttaatt acacggggtt gt 

<210> 2 
«211> 424 
<212> DNA 

<213> Homo sapiens ■ 
<400> 2 

tttttttttt ttttttaaag gtacacattt ctttttcatt ctgtttnatg 
ttcgttggca tcttctctgt gatgggcagc ttgctaaaat taiiactcagg 
ncatttccaa ctnagcccac gctttcaacc nngccnaaca aagaaaatca 
aattctttgc tgganacaaa gaactacatt cctttgtaaa tnatgctttg 
gcaaacncag attgaaggga anaagganac ttntggggac ggaaacaact 
gganccgccc agggncattt cctcaccatg cttaatcttg cnctcacttg 
ttaaacttgg tgcaaaaggc gcaattggtg nanggaaccc cacaccttcc 
gggc 

<210> 3 
<211> 421 
<212> DNA 
<213> Homo sapiens 

<400> 3 

tttttttttt tttttcccaa tttaaaaaag cctttttcat acttcaatta caccanactt 60 
aatnatttca tgagtaaatc ngacattact atttnaaaat ttgcatattt aaaatttgna 120 
tcanttactt ccagactgtt tgcanaatga agggaggatc actcaagngc tgatctcnca 180 
ctntctgcag tctnctgtcc tgtgcccggn ctaatggatc gacactahat ggacagntcn 240 



ttagatgagg 60 
tctcaaacca 120 
acttcccttt 180 
gtatcttaaa 240 
ttttaatgac 300 
acagctttca 360 
4 02 



cagcaaataa 60 
ccccttagct 120 
gttngggtta 180 
tttgctctgt 240 
ngnagaagca 300 
cngggcacca 360 
ttaaaaagca 420 
424 
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cagatcttcc gttcttntcc cttccccaat ttcncaccnc tccccttctt ncccggatcn 3 00 

tttggggaca tgntaatttc gcnatcccta aaccctgccc gccangggtc ccnanctcag 360 

gggtggttaa tgttcgncng gcttnttgac cncctgcgcc ctttnantcc naaccccaag 420 

c . ■ . . 421 

<210> 4 
<211> 423 
•e212> DNA 

<213> Homo sapiens 



<400> 4 

tttttttatt tttttttcta tttntnntat 
tgtgtatgcg tangtacnta tgtntgcata 
aaaatctcaa natngtantt ggttnatggg 
atggacacng. .tgaaatgtag ccgccnatca 
cctccnaata aaaatnccng gccctactgg 
acatgcanac nagttaaacc. tgtgnactgg 
ttcnccccan ggacantcng aattttttta 
tgt 



ttnntgnggt 


tcctgtgtgt 


aattagnang 


60 


tttaacctgt 


tncctttcca 


tttttaaaat 


120 


agtaaanaga 


gactatngat 


naattttaac 


180 


ntttaaaact 


tcactttgaa 


ggccttttnc 


240 


gttaagcaac 


attgcatntc 


taaagaaacc 


300 


tcangcaaac 


cnanntggaa 


nanaagggnn 


360 


acaaattacn 


atnccccccc 


ngggggagcc 


420 








423 



<210> 5 

<211> 355 

<212> DNA 

<213:^ Homo sapiens 



<400> 5 . 

acgaccacct natttcgtat ctttcaactc 
tccaggaaga caggtctcaa cttagggatc 
gcaacccggc acctcaagga agtgcaccga 
aggtggccaa ctgatcactg taggagctga 
agtgaccaan acnaccattn aggatcaccc 
ccaaacggct ggccaatggg ggggtttaat 



ttttcgaccg gacctcttat tcggaagcgt 60 
agatcacgtt atcaacgctc tgggatcgct. 120 
tnacgtctag accggccaac acagatctag 180 
ctggcaanan tcaaccgggc cccaaccnag 240 
acaggcactc ctcgtcctag ggccaaccna 300 
atttggtcna aaaattgatt tcaaa 355 



<210> 6 

<2ll> 423 

<212> DNA 

<213> Homo sapiens 



<400> 6 

tttttttttt tttttggaca ggaagtaaaa 
acattggaag ccctcatgan tgcagggccc 
tgcacttaac cccacagccn tctgggatna 
catcagcatt aaacttggta aanccccact 
naaacttgaa cttggccctg cgcagggcct 
tgcgnaagga cntaatnact tggccnatgt 
caaaggcacc tegcaagccc ntctggancc 
ttt 

<2X0> 7 

<211> 410 

<212> DNA 

<213> Homo sapiens 



tttattggtn antattaana ggggggcagc 6 0 
gccactcgtc cagagggcca cnatcgggga 120 
gccgcttctc agccaccatn tcttcaaatt 180 
tctttaagat ntghatcttc tggcggccag 240 
caatcacatg ctccttgttc tgcagcttgg 300 
gaaccctggc cacantgccc tggggctttc 360 
tgnccgcccc ngcacaggga caacatcttg 420 
423 



<400> 7 

ttcgcactgg ctaaaacaaa ccgccttgca aagttngaaa aatt:tat:caa tggaccaaat 60 
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aatgctcata tcchacaagt tggtgaccgt tntcatnata aaaaaatgta tnatgctcct 120 

nanctgttgt acaataatgt tccaattthg gacnttcggc atctaccctg gttcacctgg 180 

gtaaatatca ggcagctttt gatggggcta ggaaagccaa cagtactcga acatgggaaa 240 

gaggtctgct tcgccngtgt anatgggaaa naattccgtc ttgctcngat tcgtggactt 300 

catattgttg tacatgcaga tgaatnngaa gaactcgcca actactatca ggatcgtggc 360 

tttttnnaaa agctnatcac catgttggaa gcggcactng gacttgagcg 410 

<210> 8 

<211> 274 

<212> ONA 

<213> Homo sapiens 

<400> 8 

ttttttt'ttt tttttaggtc atacatattt tttattataa canatatntg tatatacata 60 

taatatatgt- gtatiatatcc acgtgtgtgt gtgtgtatca aaaacaacan aantttagtg 120 

atctatatct ntiigctcaca- tatgcatggg agataccagt aaaaaataag tnaatctcca 180 

taatatgttt taaaactcan "anaaatcnga gaga'ctnaaa gaaaacgttn atcannatga 240 
ccgtngataa tcttgaanaa thacnaaaac atat . "' 274" 

<210> 9 

<211> 322 

<212> DNA 

<2X3> Homo sapiens 

<400> 9 

tttttttttt ttttgtgcct tattgcaccg gcnanaactt ctagcactat atnaaactca 60 

ataagagtga taagtgtgaa aacccttgcc ttctctttaa tcttaatgna liaggcatctg 120 

gtttttcacc attaantgta' ataatggctn tatgtatttt tatnnatggt cttnatggag 180 

ttaaaaaagt tttcctctnt ccctngttat ctaanagttt tnatcaaaaa tgggtataat 240 

atttngttca gtacttttnc ctgcacctat agatacgatn ctgttatttt ttcttcttng 300 

cccnnanata tgatggatha ca 322 

<210> 10 

<211> 425 

<212> DNA 

<213> Homo sapiens 

<400> 10 

tttttttttt tttttattct gcagccatta aatgctgaac actagacnct tatttgtgga 60 

ggtcacaaaa taagtacaga atatnacaca cgccctgccc ataaaaagca cagctcccag 120 

ttctatattt acaatatctc tggaattcca ccttcccttc taatttgacc aatatttctg 180 

cttctcaggc agcagcgcct tctggcaacc ataagaacca acncgnggac taggtcggtg 240 

ggccaaggat caggaaacag aianaatggaa gnagcccccn tgacnctatt: aanctnt:naa' 300 

. actatctnaa ctgctagttc tcaggcttta aatcatgtaa natacgtgtc cttnttgctg 360 

caaccggaag catcctagat ggtacactct ctccaggtgc caggaaaaga tcccaaatng 420 
caggn • . 425 • 

<210> 11 

<211> 424 

<212> DNA 

<213> Homo sapiens 

<400> 11 

ttttnttant ttttttancc nctnntccnn tntgttgnag ggggtaccaa atttctttat 60 
ttaaaggaat ggtacaaatc aaaaaactta atttaatttt tnggtacaac ttatagaaaa 120 
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ggttaaggaa accccaacat gcatgcactg ccttggtaac cagggnattc ccccncggct 18 0 
ncggggaaat tagcccaang ctnagctttc attatcaetn tcccccaggg tntgcttttc 24 0 
aaaaaaattt nccgccnagc cnaatccggg cnctcccatc tggcgcaant tggtcacttg 300 
gtcccccnat tctttaangg. cttncacctn ctcattcggg tnatgtgtct caactaaatc 360 
ccacngatgg gggtcatttt tntcnnttag ccagtttgtg nagttccgtt attganaaaa 420 
ccan 424 



<210> 12 

<211> 426 

<212> DNA 

<213> Homo sapiens 

<40O> 12 

tttttttttt ttttncttaa aagcttttat 
atgttgtctg ctttttccac tagagccctt 
ctaataattc cnaaactggc atcataaata 
tcacactgtg ttngttgctt tttnacatgc 
nacatacctg gttntacaac ctgaggtaan 
ctggctaaga gctnggcnct gttnantant 
tttccnctng tgtccttgct tnagtacccc 



ctcctgctta 


cattacccat 


ctgttcttgc 


60 


aacaactcaa 


tcatggttat 


ttcaagggct 


120 


agtctcgttc 


tnatgcttgt 


tttctctcta 


180 


tttgtaattt 


ttggctgaaa gctgaaaaat 


240 


cagccttnta 


gtgtgaggtt 


ttatatiitta 


300 


tgttgtanct 


ntat.atgcca 


naggctttna 


360 


attnttttag 


gggttcccta 


naaactctat 


420 








426 



<210> 13 
<211> 419 
<212> DHA 
<213> Homo 



sapiens 



<400> 13 

tttttttttt tttttnagat agactctcac 
aatcaaggct cactgcaacc tctgccttat 
ttaaaaatat ctctncacaa ccaatgcata 
ggctcagccc tcgnaacaca tttccctgtt 
anatntaagc ttttccaggc ccagaaaagc 
acctgccacc ctgtggcagc acagctccac 
tgtaancccc ctgnaagacc cggatcagct 



tctttcgccc aggctggagt gcagtggcgc 60 
aaagcatttn ctaaaggtac aagctaaatt 120 
acaaaaatta gttctacctc ataaacncnt 180 
ctcaactgat gaacactcca naaacagaac 240 
tcgcgagggg atttgctntg tgtgtgacac 3 00 
acntgctttg ggccgcattt gcaagttctc 360 
gggtngaaat tgcangcnct cttttggca 419 



<210> 14 

c211> 400 

<212> DNA 

<213> Homo sapiens 



<400> 14 

aanccattgc caagggtatc cggaggattg 
ccctcaggaa agcaaagagc ttgaaaaatg 
tcanactgct ccaacaagga tntgcanagg 
actgcagtcn tcccccantg gcagaaggat 
gaaggtcgtg gatnacttgg accgagcctc 
aagacaaagc anttcatcga cgccaacccc 
gcggcgcccc cgcccagggc cttaataanc 

<210> 15 

<211> 39S 

<2a2> DNA 

<213> Homo sapiens 



tggctgtcac aggtnccgag gcccanaagg 60 
tctctctgtc atggaagccn aagtgaaggc 120 
gagatcgcta accttggaga ggccctggcc 18 0 
gaattgcggg agactctcan atcccttang 24 0 
nnaagccaat ntccagaaca agtgttggag 3 00 
naccggcctc tnttctcctg ganattgana 360 
cntgaagctn 400 



wo 99/33869 



PCTAJS98/27416 



5 



<400> 15 

tgctttgctg cgtccaggaa gattagatng aanaatacat attgatttgc caaatgaaca 60 

agcgagatta gacntactga anatccatgc aggtcccatt acaaagcatg gtgaaacaga 120 

tgatgaagca attgtgaagc tatcggatgg ctttnatgga gcagatctga gaaatgtttg 180 

tactgaagca ggtatgttcg caactcgtgc tgatcatgat tttgtagtac aggaagactt 240 

catgaaagcn gtcagaanag tggctnattc tnaaagccgg agtctaaatt ggacnacnac 300 

ctntgtattt actgttggan Ctttgatgct gcatgacaga ttttgcttan tgtaaaaatn 360 

aagttcaaga aaattatgtt agttttggcc attat 395 

<210> 16 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<400> 16 . • ■ • ■ 

ccaccactaa aatcctggct gagccctacn agtacctgtg cccctccccc aggacgagat 
nagggcacac cctttaagtn aggtgacagg tcacctttaa gtgaggacag tcagctnaat 
ttcacctctt gggcttgagt acctggttct cgtgccctga ggcgacnctn agccctgcag 
ctnccatgta cgtgctgcca atngtcttga tcttctccao gccnctnaac ttgggcttca 
gtaggagccg caggcnagaa ngaagcggcc aacagcgcca ccccatagcc gcagccnggc 
tgcccctgct tctcaaggag gggtgtgggg ttcctccacc atcgccgccc ttgcaaacac 
ntctcanggc ttccctnccg gctnancgca ngacttaagc atgg 

<210> 17 
<211> 360 

<212> DNA . . • 

<213> Homo sapiens 

<400> 17 " 

ggccagaagc • tttccacaaa ccagtgaagg tggcagcaaa gaaagcctct tagacnagga 60 
gctggcagca gctgctatcc ngatngacng cagaaaccaa ccaccaattc agcaaacaca 120 
acctcatacc tnaccgcttc- cctttnaatg gccttcggtg tgtgcgcaca tgggcacgtg 180 
cggggagaac catacttatt cccctnttcc cggcctacca cctccnctcc cccttctctt 240 
ctctncaatt actntctccn ctgctttntt ctnancacta ctgctngtnt cnanagccng 300 
cccgcaatta cctggcaaaa ctcgcgaccc ttcgggcagc gctaaanaat gcacatttac 360 

<210> 18 
<211> 316 
<212> DNA 
<213> Homo sapiens 

<400> 18 

atacatatac acatatatga 
tgtgtgtata tgtatgtgtc 
agccagtttt tcatttgctt 
taatgttnag tagtgagatt 
cattcatgtt gttttttaaa 
tcgatrittat gtangt 

<210> 19 
<2ll> 350 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
24 0 
300 
360 
404 



ttttagatag agccatatac 
tactcatttt aaacaaacct 
aaatnactca ccaagtaact 
ctgttgaagg tgatattaaa 
agctcatttg aaatcnaatt 



ctngaagtag tanatttgtt 60 
gtgatagaga tgtaaccntg 120 
aattaagttn tctttactct 180 
aaccattcta tattaattaa 240 
atgattactt ttcataccag 300 
316 



<400> 19 
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aagggatgca nataatgctg tgtatgagct 
tactattgaa catgctnggg ctcggtcacg 
ccgttttagt agtcgcagac cccgaaacga 
anatcgtctt atagttgaga atttatcctc 
agttttgagt tnttttgatg tggcttttta 
ttttattaaa agtagttttn aattaacgga 

<210> 20 
<211> 367 
.<212> DNA 
<213? Homo sapiens 



tgatggaaaa gaactctgta gtgaaagggt 60 
aggtggaaga ggcagaggac gacactctga 12 0 
tagacgaaat gctccacctg taagaacaga 180 
aagagtcagc tggcaggctc gtCganatac 24 0 
aaaaagttat gggttactna tgttacattg 300 
tntgatggaa ttgttgtttt 350 



<400> 20 

gntnnncnca agatcctnct ntcccccngg 
ntaanatcnn gccgcncccg aagtctcnct 
ncaattntga cctnnggcga anaatggcng 
tagnatctga ccactangac ccnctatcct 
ccaattagtg catgntanag cntcctggcc 
cgcccctacc angncatccn catctactag 
ccccnct 



gcngccccnc cnccngtnat naccggtttn GO 
nntgccgaga tgncccttat ncncnnatgn 120 
nngtgtatca gtntccnctc tgnggnctct 180 
ctcaaaccct gtanncngcc ctaatttgtg 240 
cagatggcnt ccatatcctg gtncggcttc 300 
agcttatccg ctncntgngg cgcaccggnt 360 
367 



<210> 21 . 

<211> 366 

<212> DMA 

<213> Homo sapiens 

<:400>'21 

cccaacacaa tggtctaagt anaactgtat tgctctgtag tatagttcca cattg'gcaac 60 
ctacaatggg aaaatccata cataagtcag ttacttcctn atgagctttc tccttctgaa 120 
tcctttatct tctgaagaaa gtacacacct tggtnatgat. atctttgaat tgcccttctt 180 
tccaggcatc agttggatga ttcatcatgg taattatggc attatcatat tcttcatact 240 
tgtcatacga aaacaccagt tctgcccnna gatgagcttg ttctgcagct cttagcacct 300 
tgggaatatt cactctagac cagaaacagc tcccggtgct ccctcatttt ctgaggctta 360 
aactcn 366 

<210> 22 
<211> 315 
<212> DNA 
c2i3> Homo sapiens 

<400> 22 . •■ . 

acttaatgca atctctggag gataatttgg atcaagaaat aaagaanaaa tgaattagga 
gaagaaatna ctgggtnata tttcaatatt ttagaacttt aanaatgttg actatgattt 
caatatattt gtnaaaactg agatacangt ttgacccata. tctgcatttc gataattaaa 
cnaatnnatt ctatttnaaf gttgtttcag agtcacagca cagactgaaa ctttttttga 
atacctnaat atcacacttn tncttnnaat gatgttgaag acaatgatga catgccttna 
gcatataatg tcgac 



60 
120 
180 
240 
300 
315 



<210> 23 

<211> 202 

<212> DNA 

<213> Homo sapiens 



<400> 23 

actaatccag tgtggtgnaa ttccattgtg ttgggcaacc caggatatta aacttatnat 60 
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ctaaaaattc ccaagagaaa naaactccag gccctgattg tttcactggg gaattttacc 120 
aaatgttnca nnaaganatg acgctgattc tgtnaaatct ttttcagaag atagaggaga 180 
acacccaccg nttcattttai tg ' 202 



<210> 24 
<211> 365 
<212> DNA 
<213> Homo 



sapiens 



<400> 24 

ggatttcttg cccttttctc cctttttaag 
ctttctgcca tacaaccgct accacatctg 
ggatctcgga accnagtgtt nacttcattt 
tggtctgtct gtatteacca tggggcctgt 
gtgccctcca ttgtggccac atgggag'acc 
acggcactgq atctnacccg acntgggatt 
gatca 

<2i0> 25 

<211> 359 

<212> DNA 

<213> Homo sapiens 



tatcaatgta tgaaatccac ctgtaccacc 60 
gctcctagaa cctgttttgc tttcatagat 120 
ttaaacccca ttttagcaga tngtttgctn 180 
acacaccacg tgtggttata gtcaaacaca 240 
catnacccna taict'gcatcc tgggctgatn 300 
gaacccgggg tgggcagcng aattgaacag 360 
• • 365 



<400> 25 

gtttcctgct tcaacagtgc ttggacggaa 
gcccatagcc agccctccgt cacctcttca 
cogccnctcc ngcgccncgc agccaccgcc 

nacaacgcgt ccacctcgca ngttcgccng 
aaccgcccga tcaacctgga gccctncccc 
nttaaccgcc gnttattttg cttnaaaaga 

<210> 26 

<21l>'400 

<212> DNA 

<213> Homo sapiens 



cccggcgctc gttccccacc ccggccggcc 60 
ccgcaccctc ggactgcccc aaggcccccg 120 
gccnccncca cctctccttn gtcccgccnt 180 
aactaccacc nggactcata ngccgccccc 240 
ccgacnttaa cctttccntg tcttacttac 300 
acttttcccc aatactttct ttcaccnnt 359 



<400> 26 

agtgaaacag tatatgtgaa aaggagtttg 
ttataatttc caataggata ctcatcagtt 
ggtttctggt ttcagatttg aactctcaag 
acnacnaana aatctnaacn aacngaanac 
ttcagccacg aggaaaacta tcnccctnaa 
aatcacagtg aatcatagcc caagatcagc 
acttacaggg accacttcac agtnngtnga 

<210> 27 

<211> 366 

<212> DNA 

<213> Homo sapiens 



tgannagcta 


cacaaaaata 


ttagatatct 


60 


ttgaataana 


gacatattct 


agagaaacca 


120 


agcttggaag 


ttatcacccc 


caccctcacg 


180 


caatgacttt 


tcttagatct 


gtcaaagaac 


240 


tactggggac 


tggaaagaga 


gggtacagag 


300 


ttgcccggag 


ctnaagccng 


tacgatnatt 


360 


tnaantgccn 






400 



<400> 27 

gaatttctta gaaactgaag tttactctgc 
agggcgctng aatcatagca aatattctca 
gaattttaca ttttccagaa aacactcctt 
gctgtagact gggctgcact ggacacctgc 



tccaagatat atctccactg tcttaatcaa 60 
tctttcaact aactttaagt agttntcctg 120 
tctgtatctg cgaaagaaag tgtgcctcag 180 
gggggactct ■ ggctnagtgn ggacatggtc 240 
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agtattgatt ttcctcanac tcagcctgtg tagctntgaa agcatggaac agattacact 300 
gcagttnacg tcatcccaca catcttggac tccnagaccc ggggaggcca catagtccgt 360 
tacgna 366 

<210> 28 

<211> 402 

<212> DNA 

<213> Homo sapiens 



<400> 28 

agtgggagcc tcctccttcc ccactcagtt 
ggaagtggcc agctgcagcg cctcctgcag 
agacacatcc ttgccaccac ctttaccgtc 
gctngctttt aagccccgat nggctgcatt 
gccagcctca ttgtccaccg cgaagagcat 
anagcttcaa ggcttcattc agggccttng 
acnagaggct ggtnngggtn actntcaata 



ctttacatcc ccgaggcgca gctgggcnaa 60 
gcagccaacg ttcttgcctg cggcccgtgc . 120 
catcangcct gacacctgct gcacccactc 180 
ctgggggact tgacacaggc ncgtgatctt 240 
ggcaaaaagc ccgaggggag tg'catcttga 300 
ctnaggcgcc nctctccatc tccnggaata 360 
aactgcttcg tc 402 



<210> 29 

<2ll> 175 

<212> DNA 

<:213> Homo sapiens 

<400> 29. 

cggacgggca tgaccggtcc ggtcagctgg gtggccagtt tcagttcttc. agcagaactg 60 
tctcccttct tgggggccga gggcttcctg gggaagagga. tgagtttgga gcggtactcc 120 
ttcagccgct gcacgttggt ctgcagggac tccgtggact tgttccgcct cctcg 175 

<210> 30 

<211> 360 

<212> DNA 

<213> Homo sapiens 



<400> 30 

ttgtattcct tatgatctct gatgggctct 
gcatgctcca gatttaaatc cagctgaggc 
tggaaggaaa cttcacggac aggaagactg 
aggcctgggg aatcacgtaa agggtaccca 
gttctttcag ggaaccaaac ccagaattcg 
gataaatgcc ttgggacctg gagtgctggg 

<210> 31 

<211> 380 

<212> DNA 

<213> Homo sapiens 



tcccgaaaat gccaagcgga agactttgtg 60 
tccctttgtt ttcagttcca tgtaacaatc 120 
ctggagaaga gaagcgtgtt agcccatttg 180 
gacctcactt ttagttattt acatcaacga 240 
gtgcaaaagc caaacatctt ggtgggattt 300 
ctcgtgcaca ggaagagcac cagccgctga 360 



<400> 31 

acgctctaag cctgtccacg 
acgctacgcc atggtttatg 
agggaccnaa tgagactgag 
gattttgtat ccccctgtnn. 
ttcctgtata ttctcttgtc 
tgaaagtgca ctgcagtnag 
acttctggtg tgatactttc 



agctcaatag ggaagcctgt 
gatacaatgc tgcctaraan 
ggaagaaaaa aaacctcttt 
cagcattncn gaaatacata. 
tggctgcacc- ccttnttc.cc- 
ggtcaangga gacccancat 



gatgaccaca gactttgcga 60 
cgctacttca ggaagcgccg 120 
ttttctggag gctggcacct 180 
ggcttatata caatgcttct 240 
gcccccagat tgataagtaa 300. 
atgtgattgc tccntnataa 360 
380 
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<210> 32 

<211> 440 

<212> DMA 

<213> Homo sapiens 

<400> 32 

gtgtatggga gcccctgact cctcacgtgc 
ccaccccctg cacctccacc tgccccagcc 
cctcacttct ggggtggatg atgtgacctt 
ccctgaagtc ttacggtcca acatcaggac 
caggggagaic cgcnccanca gggacgtgtg 
gtganaagca cgtggcggct tctgggggcc 
cctggagaac ctcagtcccn gtagccccct 
cacccttcgg gggttggggt 

<210> 33 

<211> 345 

<212> DNA 

<213> Homo sapiens 



ctgatccgtg cccttggtcc caggtcaggc 60 

cctgcctctg ccccaagtgg ggccagctgc 120 
cctnggggga ctgcggaagg gacaagggtt 180 
caagtcccat ggacatgctg acagggtccc 240 
cctggctgtg tacgcgggtg tgcagtgcac 300 
atgtttgggg aaggaagtgt gcccnccacc 360 
gccctggcac agcngcatnc acttcaaggg 420 
440 



<400> 33 

tattttaaca atgtttatta ttcatttatc cctctataga accaccaccc acaccgagga 60 

gattatttgg agtgggtccc aacctagggc ctggactctg aaatctaact ccccacttcc 120 

ctcattttgt gacttaggtg ggggcatggt tcagtcagaa ctggtgtctc ctatcggatc 180 

gtgcagaagg aggacctagg cacacacata tggcggccac acccaggagg gttgattggc 240 

aggctggaag acaaaagtct cccaataaag gcacttttac ctcaaagang gggtgggagt 300 

tggtctgctg ggaatgttgC tgttggggtg gggaagantt atttc 345 

<210> 34 

<211> 440 

<212> DNA 

<213> Homo sapiens 

<400> 34 

tgtaattttt ttattggaaa acaaatatac aacttggaat ggattttgag gcaaattgtg 60 

ccataagcag attttaagtg gctaaacaaa gtttaaaaag caagtaacaa taaaagaaaa 120 

tgtttctggt acaggaccag cagtacaaaa aaatagtgta cgagtacctg gataatacac 180 

ccgttttgca atagtgcaac ttttaagtac atattgttga ctgtccatag tccacgcaga 240 

gttacaactc cacacctcaa caacaacatg ctgacagttc ctaaagaaaa ctactttaaa 300' 

aaaggcataa cccagatgtt ccctcatttg accaactcca tctnagttta gatgtgcaga 360 

agggcttana ttttcccaga gtaagccnca tgcaacatgt tacttgatca attttctaaa 420 

ataaggcttt aggacaatga 440 

<210> 35 

<211> 540 

<212> DNA 

<213> Homo sapiens 

<400> 35 

atagatggaa tttattaagc ttttcacatg tgatagcaca tagttttaat tgcatccaaa 60 

gtactaacaa aaactctagc aatcaagaat ggcagcatgt tatttcataa caatcaacac 120 

ctgtggcttt taaaatttgg ttttcataag ataatttata ctgaagtaaa tctagccatg 180 

cttctaaaaa atgctttagg' tcactccaag cttggcagtt aacatttggc ataaacaata 240 

ataaaacaat cacaatttaa taaataacaa atacaacatt gtaggccata atcatataca 3 00 

gtataaggga aaaggtggta gtgttganta agcagttatt agaatagaat accttggcct 3 60 
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ctatgcaaat atgtccagac actttgattc actcagccct gacattcagt tttcaaagcc 420 
aggaaacagg ttctacagca tcattttaca gtttccaaca cattgaaaac aagtagaaaa 4 80 
tgatganttg atttttatta atgcattaca tcctcaagan ttatcaccaa cccctcaggt 540 



<210> 36 
<211> 555 
<212> DNA 
<213> Homo 



sapiens 



<400> 36 

cttcgtgtgc ttgaaaattg. gagcctgccc 
gaagtgtata tggggcccaa ncta.ctggtg 
tgctgtcgag cactgcaaac gccatgtgtg 
agaaaccaca gcattggttt ttttctactt 
gtttgacttt gctataaaaa tagggctccc 
tagcantgct gtctgcaagg gagcccctan 
ctttcctctc tgctaaatgg atgttgatgc 
gcncctgctg gaggaagana aaactctgct 
cntcaaccct cttggttgaa gccttgttcc 
gnctgggctt ctnaa 

<210> 37 

<211> 280 

<212> DNA 

<213> Homo sapiens 



ctcggcccat aagcccttgt tgggaactga 60 
ccagaacaca gagacagcag cccaiitgcaa 120 
gaactaggag gaggaatatt ccatcttggc 180 
gtgtgtctgg gggaatgaac gcacagatct 240 
ccacctcccc cntttctgtg. tnctttattg 300 
cccctggcag acananctgc ttcagtgccc 360 
actggaggtc ttttancctg cccttgcatg 4 20 
ggcatgaccc acagtttctt gactggangc 4 80 
gaccctgaca tntgcttggg cnctgggtng 540 
555 



<4b0> 37 

ccaccgacta taagaactat gccctcgtgc 
acgtggattt tgcttggatc ttggcaagaa 
ctctaaaaaa taccctgact tctaataaca 
aggtgaactg ccccnagctc tcgtaaccag 
ttncttctgc ttcgctttcc cctaccccac 

<210> 38 

<211> 303 

<212> DHA 

<213> Homo sapiens 



attcctgtac ctgcatcatc caactttttc 60 
accctaatct ccctccagaa acagtggacc 120 
ttgatntcaa gaaaatgacg gtcacagacc 180 
gttctacagg gaggctgcac ccactccatg 240 
cccccgccat 280 



<400> 38 

catcgagctg gttgtcttct tgcctgccct 
tatcaaggga aaggcaagac tgggacgcct 
cttcacacag gtgaactcgg aagacaaagg 
gaccaattac aatgacngat acgatnagat 
tcctaagtct gtggctcgta tcgccnagcc 
taa 



gtgtcgtaaa atgggggtcc cttactgcat 60 

agcccacagg aagacctgca ccactgtcgc . 120 
cgctttggct nagctggtgn aagctatcag 180 
ccgccntcac tggggtagca atgtcctggg 240 
cgaanaggcn aangctaaag aacttgccac 300 
303 



<210> 39 

<211> 300 

<212> DHA 

<213> Homo sapiens 

<400> 39 ■ 

gactcagcgg ctggtgctct tcctgtgcac aagcccagca ctccaggtcc caaggcattt 6 0 
atcaaatccc accaagatnt ttggctttcg caccgaattc tgggtttggc tccctnaaag 120 
aactcattga tgcaaatnac tnaaagtgag gtctgggtac ccttcacatg attccccaga 180 
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cctcanatgg gctaacacgc ttctcttctc cagcagtctt cctntccgtg aagttacctt 240 
ccagattgtt acatggaact gaanacaaag ggagcctcag ctngatttaa atctggagca 300 



<;211> 318 
<212> DNA 
<213> Homo sapiens 



<400> 40 

cccaacacaa tggctgagga caaatcagtc 
tgggctgttg ggcgaccaag gccttcccgg 
gatggtgaag ctgacggaga agcacaggtc 
catcattggg ggcatgttca eagtggctgg 
acgagccatc caga'aaaaaa ttgatctngg 
tctgtctcct ctttctcc 



ctctgtgacc agacatgaga aggttgccaa 6 0 
agtcctcgcc ctctatgagc tctcgcccat 120 
cttcacccac ttcctgacag gtgtgtgcgc 180 
actcaccgat tcgctcatct accacccagc 240 
gaagacnacg tagtcaccct cggtncttcc 300 
318 



<211> 302 
<212> DNA 
<213> Homo sapiens 



<400> 41 

acttagatgg ggtccgttca ggggatacca 
cttggcctga atgttcccca tccggacaca 
ccatnaccat ctcggtaacc tactcttact 
ataatnagtc cataatgtaa atgcctggcc 
ccaaacnatt accagacatt cctcttanat 
tc 



gcgttcacat ttttcctttt aagaaagggt 60 
ggctgcatgt ctctgtnagt gtcaaagctg 120 
ccacaatgtc tatnttcact gcagggctct 180 
caagacntat ggcctgagtc tatccnaggc 240 
tgaaaacgga tntctttccc ttggcaaaga 300 
302 



<210> 42 
<211> 299 
<212> DNA 
<213> Homo 



sapiens 



<400> 42 

cttaataagt ttaaggccaa ggcccgttcc 
ggtttggaca tacctcatgt aaatgtggtt 
gattacatcc atcgagtagg tcgaacagct 
tttgtcacac agtatgatgt ggaactcttc 
ctaccaggtt ttccaacaca ggacgatgag 

<210> 43 

<211> 305 

<212> DNA 

<213> Homo sapiens 



attcttctag caactgacgt tgccagccga 60 
gtcaactttg acattcctac ccattccaag 120 
agagccgggc gctccggaaa ggctattact 180 
cagcgcatag aacacttnat tgggaagaaa 240 
gttatgatgc cnacggaacg cgtcgctna 299 



<400> 43 

ccaacaatgt caagacagcc gtctgtgaca 
ccttcattgg caatagcaca gccntccggg 
ccgccatgtt ccgccggaag gccttcctcc 
tggagttcac cgaggctgag agcaacatga 
gggatgccac cgcagaaana ggaggaggat 

<210> 44 



tcccacctcg cggcctcaan atggcagtca 60 
agctccccaa gcgcatctcg gagcagttca 120 
actggtacac aggcgagggc atggacaaga 180 
acgacctcgc ctctnagtat cagcagtacc 240 
ttcggtnagg aggccgaaga aggaggcctg 300 
305 
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<211> 399 . 
<212> DNA 

<213> Homo sapiens 

<400> 44 

tttctgtggg ggaaacctga tctcgacnaa 
tggaacagaa cgaaaacnga tnaatctctg 
cagacacagc tccnaattga ttccttcttt 
ttaacgtatt aagagccnga gactaaacag 
agaagcagcn gcccgcgnaa ttngaagcng 
ttctttatta ggccaacgaa aaaccccgaa 
tccngccnna aaaagaaaga agctttcnga 

<210> 45 

<2ll> 440 

<212> DNA 

<213> Homo sapiens 



attagagaat tttgtcagcg gtatttcggc 60 
tttcctgtat taaagcaact cgacncccag 120 
ngatcagcac aacagggaga aagaanatgc 18 0 
agctttgaca tgcatgctta ggaaagagaa 24 0 
tttctgttgc cntgganaaa gaatttgagc 300 
ananaggcnt tacnatacct tngaeiaancc 360 
tcctcaacc 399 



<400> 45 

gcgggagcag aagctaaagc caaagcccaa 
accagtacca ataacagtgc cagtgccagt 
gccagcctga ccgccactct cacatttggg 
agcaccagtg gcagctctgg tgcctgtggc 
ccttggtttc agtggggaca., tctggggctt 
taggaaggcc cangttggag . aangatgtgn 
ttattccttt, ctgtttgctg gangtcaatt 
canatatngt taatcctgcc 



gagagtggca gtgccagcac tggtgccagt 60 
gccagcacca gtggtggctt cagtgctggt 120 
ctcttcgctg gccttggtgg agctggcgcc 18 0 
ttctcctaca agtgagattt taggtatctg 240 
anggggcngg gataaggagc tggatgattc 300 
anagtgtgcc aagacactgc tt<ttgg.catt 360 
gacccttnna ntttctctca .cttgtgtttt 420 
440 



«210> 46 
<211> 472 
<212> DNA 
<213> Homo 



sapiens 



<400> 46 

gctctgtaat ttcacatttt aaa.ccttccc 
gtttctctgt tcctcttcac agoaaaaact 
ccactttctc acccccactc tccccccaat 
tgtggctntt attanagtca ccaaccttat 
cttctcagca gcactcagct ctggtncttg 
cctcataaca atctccttcc cagcctccac 
ctaggctcag atgtagtgta gcccaaccct 
aaaaatgtcc atncntgtcc tgtgagtgat 



ttgacctcac attcctcttc ggccacctct 60 
gttcaaaaga gttgttgatt accttcattt 120 
taactctcct tcatccccat gatgccatta 180 
tctccaaaac anaagcaaca aggactttga 240 
aaacaccccc gtcacttgct attcctccta 300 
tgctgccttc tctgagttct tcccagggtc 360 
gctacacaaa gnaatctcct gaaagcctgt 420 
ctnccangna naataacaaa tt 472 



<210> 47 

<211> 550 

<212> DNA 

<213>^ Homo sapiens 



<400> 47 

ccttcctccg cctggccatc. cccagcatgc 
aggtcgggag cttcctcagt ggtctgtatg 
g9g99ccctg tctgggtgca aggcgacagc 
gtggagctgg gcgctcagtc catcgtgtat 
gcaggcttca gtgtggctgc cagtgtccgg 
gaagcaggca cggaagtcct ctaccgttcc 



tcatgctgtg catggagtgg tgggcctatg 60 
aggatggatg acggggactg gtgggaacct 120 
tgtctttctt caccaggcat cctcggcatg 180 
gaaccggcca tcattgtgta catggtccct 240 
gtangaaacg ctctgggcgc tggagacatg 300 
cctgctgatt acagtgctct ttgctgtanc 360 
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cttcagtgtc ctgctgttaa gctgtaagga 
agaacatcac taacctggtg gctcaggtgg 
aagctcttgc tgctcaggca cacgccaatt 
gaattccgct 



tcacntgggg tacattttta ctaccgaccg 420 
ttccaattta tgctgtttcc cacctctttg 480 
ttgaaaagta aacaacgtgc ctcggagtgg 540 
550 



<210> 48 

<211> 214 

<212> DNA 

<213> Homo sapiens 



<400> 48 

agaaggacat aaacaagctg aacctgccca 
acaacctcct caacttcaag ctggccatct 
agtttgtgtt cagttttaag gtgggccagg 
gtgagacnat ggtctatcac cccnacattg 

<210> 49 
<211> 267 
<212> DNA 

<213> Homo sapiens 



agacgtgtga tatcagcttc tcagatccag 60 
gtcctgatna gggcttctac nagagtggga 120 
gttacccgca tgatcccccc aaggtgaagt 180 
acct 214 



<400> 49 

atctgcccaa aatttattca 
agtttttagg acaactatgc 
gtngaggaac ttaatccaac 
tggcaaggaa tggagacnga 
ctgctntcgt gtctcccang 



aataacgaaa acnaatctgt 
acaaatgtac gatiggagaat 
cggagctntt gtgaaggtca 
gtttgcaaat tgcagctaga 
gaaagct 



tttaagaaat tcagtctttt 60 
tctttttgga' tnaactctag 120 
gaansicagga gagggaatct 180 
grnaatngtt ncaaatggga 240 

267 . 



<210> 50 
<211> 300 
<212> DMA 
<213> Homo 



sapiens 



<400> 50 

gactgggtca aagctgcatg aaaccaggcc 
ggagagaacq tgacttctct ttccctctcc 
ttgggktctt ctgagcttgt ttccctgctg 
gtctagaaga ggcagccctt ctttgtcctc 
ggagagacca anagcctctg attcttaatt 

<210> 51 

<211> 300 

<212> DMA 

<213> Homo sapiens 



ctggcagcaa cctgggaatg gctggaggtg 60 
ctcctccaac attactggaa ctctgtcctg 120 
ggtgggacag aggacaaagg agaagggagg 180 
tggggthaat gagcctgacc tanagtagac 240 
tccataanat gttcnaagta tatntntacc 300 



<400> 51 

gggtaaaatc ctgcagcacc cactctggaa 
cccctatttc tagttggtcc aggattaggg 
aagcgctctc caagcacccc cggcctgggg 
gatcaiggttn aataaatgga actcttcctg 
gggctctgtt agaggggacc tccaccctrin 



aatactgctc ttaattttcc tgaaggtggc 60 
atgtggggta tagggcattt aaatcctctc 120 
gtnagcttct catcccgcta ctgctgctgg 180 
tctggcctcc aaagcagcct aaaaactgag 240 
ggaagtccga ggggctnggg aagggtttct 300 



<210> 52 
<211> 267 
<212> DNA 
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<213> Homo sapiens 
<400> 52 

aaaatcaact tcncgcatta acanacanat 
cacctatcaa ggaacnnact tgattgcctc 
tacctgaaca ccnccgcata actctcaacc 
atgctantcc cgaattcttc attatatcng 
gtatgtnccn taactgccga nncaang 

<210> 53 

<211> 401 

<212> DNA 

<213> Homo sapiens 



tccanancag gaagtgaana taattctctg 60 
tactnaacan atatatcgag ttnctatact 120 
nanatnentc nccatgacac tcntccttna 180 
tgatgttcgn cctgntnata tatcagcaag 240 
267 



<400> 53 

agsctttagc atcatgtaga agcaaactgc 
caagattttg tgttttctag ctgtccagga 
agagcaagtg aaaccatttc cagcctaaac 
gacctctiaag gctccataat eatcattaaa 
tatatacagg atcaaaatca acattaaatc 
ttgagcattt taaatagtac agtaggctgg 
caaat:agaaa actaaagaaa ttagataggc 

<210> 54 

<2ll> 401 

<212> DNA 

<213> Homo sapiens 



acctatggct gagataggtg caatgaccta 60 
aaagccatct tcagtcttgc tgacagtcaa 120 
tacataaaag cagccgaacc aatgattaaa 180 
tatgcccaaa ctcattgtga ctttttattt 240 
atcttattta catggccatc ggtgctgaaa 300 
tatacattag gaaacggact gcactggagg 360 
tggaaatgct t 401 



<400> 54 

cccaacacaa tggataaaaa cacttatagt 
aagctacaga ttgtcatagt tgttttcctg 
cagtttgacc tttgtcttct ataatatttc 
tttgattctt aactaaaatt gttctcttaa 
ggtgtattct ctttacctcc aaggaaagaa 
gcattgtttt ggtataaggt acatattttg 
gtgcatccaa tttattatag ttttgtaagt 

<210> 55 

<211> 933 

<212> DNA 

<213> Homo sapiens 



aaatggggac attcactata. atgatctaag 60 
ctttacaaaa ttgctccaga tctggaatgc 120 
cttttcctcc ccccttcgaa tctctgtata 180 
atattctgaa tcctggtaat taaaagtttg 240 
ctactagcta caaaaaatat tctggaataa 300 
gttgaagaca ccagaccgaa gtaaacagct 360 
aacaacatgt a 401 



<400> 55 

tttactgctt ggcaaagtac cctgagcatc 
tcctagggga tgggtcttct attacctggg 
tgtgcatcaa ggaatgcctc cgcctctacg 
caaacccatc acctttccag atggacgctc 
tatttgggct cttcaccaca acccctattt 
gagattctcc agggaaaatt ctgaaaaaat 
tggattaagg aactgcattg ggcagcattt 
attaactctg ctccgcttca agctggcccc 
agttgcctca agcccaagaa cggaatccat . 
gtccttttcg tacaagaatc aakgagacaa 
aatataatac aaaatatatg tatatggctg 
tgactggttt tgacatccat taacagcaat 



agcagagatg ccgagatgaa atcagggaac 60 
aacacctgag ccagatgcct tacaccacga 120 
caccggcagt aaactatccc ggttactcga 180 
cctacctgca ggaataactg tgtttatcaa 240 
ctgggaagac cctcaggtct ttaacccctt 300 
acatccctat gccttcatac cattctcagc 360 
tgccataatt gagtgtaaag cggcagtggc 420 
agaccactca aggccaccca. gctgtcgtca. 480 
gtgtttgcaa aaaaagtttg ctaattttaa 540 
ttttcctacc aaaggaagaa caaaaggaca 600 
tttgacaaat tatataactt aggatacttc 660 
tttaatttcc tcgctgtatc tggtgaaacc 720 
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cacaaaaaca cctgaaaaaa ctcaagctga 
ggtaactagt ggtagagtgg ctttcaagca 
cattactttt atctctgcaa atatctgcat 
taataaaaaa tatctgccaa aaaaaaaaaa 



gttccaatgc gaagggaaat gattggtttg 780 
tagtttgatc aaaactccac tcagtatctg 840 
gatagcttta ttctcagtta tctttcccca 900 
aaa 933 



<210> 56 .... 

<211> 480 

<212> DNA 

<213> Homo sapiens 

<400> 56 

ggctttgaag catttttgtc tgtgctccct 
gcagtcctgg tactcttggg agtttccatc 
gctgctccag ctgacacgta tccagctact 
gaaaccactg ctgctgcaac cactgcgacc 
gcttctacca ctgctcgtaa agacattcca 
aatggtagag tgtgtccctg agatggaatc 
attcatgctt cctgtgattt catccaacta 
ctaatcagtt tattttcttt caaataaaaa 

<210> 57 

<211> 798 

<212> DNA 

<213> Homo sapiens 



gatcttcagg tcaccaccat gaagttctta 60 
ttcctggtct ctgcccagaa tccgacaaca 120 
ggtcctgctg atgatgaagc ccctgatgct 180 
actgctgctc ctaccactgc aaccaccgct 24 0 
gttttaccca aatgggttgg ggatctcccg 300 
agcttgagcc ttctgcaact ggtcacaact 360 
cttaccttgc ctacgatatc ccctttatct 420 
ataactatga gcaacaaaaa aaaaaaaaaa . 4 80 



<400> 57 

agcctacctg gaaagccaac cagtcctcat 
gactaacttt gtgatatggg aagcgaaaac 
agatgaccag agtactctta accccttaga 
gatggtattg ttttcatgag cttctagaaa 
tgttactgcc attcctattt acagtatatt 
catggggctt ttttggttgt cctaaactta 
tgatcataat ttttgcgata atttctggcc 
tatatatttt aaatagattt gataggtttt 
aagttatttg gggttgtctg ggattgtgtg 
ttcaccttgg cagttcattc gtggatggca 
tcttggattg ttttgcaaat tacagctgaa 
ggctagaata ggaagagaga aaaaatgaaa 
aaaattttta atgttaagda aaccttaaat 
aaaaaaaaaa aaaaaaaa 

<210> 58 

<211> 280 

<212> DNA 

<213> Homo sapiens 



aatggacaag atccaccagc tcctcctgtg 60 
agttaacacc ttgcacgacc aaacgaacga 120 
actgtttttc cttttgtatc tgcaatatgg 180 
tttcacttgc aagtttattt . ttgcttcctg 240 
tgagtgaatg attatatttt taaaaagtta 300 
caaacattcc actcattctg tttgtaactg 360 
tgattgaagg aaatttgaga ggtctgcatt 42 0 
taaattgctt tttttcataa ggtatccata 480 
aaagaaaatt agaaccccgc tgtatttaca 540 
gttttctgta gttttgggga ctgtggcagc 600 
atctgtgtca tggattaaac tggcttatgt 660 
tggttgttta ctaattttat actcccatta 720 
aaacatgatt gatcaatatg gaaaaaaaaa 780. 

798 



<400> 58 

ggggcagctc ctgaccctcc acagccacct 
gaggtcccac tgagcctctc gcctgccccc 
cccctgcctg gtcccccaca aggactccca 
tggaccacgg tcgtgaggaa gggctcatgc 
cagaataaac cgagaaggaa accagaaaaa 



ggtcagccac cagctggggc aacgagggtg 60 
gccactcgtc tggtgcttgt tgatccaagt 120 
tccaggcccc ctctgccctg ccccttgtca 180 
cccttattta tgggaaccat ttcattctaa 240 
aaaaaaaaaa 280 



<210> 59 
<211> 382 
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<212> DNA 

<213> Homo sapiens. 
<400> 59 

aggcgggagc agaagctaaa gccaaagccc aagagagcgg cagtgccagc actggcgcca 60 
gtaccagtac caataacagt gccagcgcca gtgccagcac cagtggtggc ttcagtgctg 120 
gtgccagcct gaccgccact ctcacatttg ggctcttcgc tggccttggt ggagctggtg 180 
ccagcaccag tggcagctct ggtgcctgtg gtttctccta caagtgagat tttagatact 240 
gttaatcctg ccagtctttc tcttcaagcc agggtgcatc ctcagaaacc cacccaacac 3 00 
agcactccag gcagccacta tcaatcaatt gaagttgaca ctctgcatta aatctatttg 360 
ccattaaaaa aaaaaaaaaa aa 382 

<210> 6.0 ■■ . 

<211> .602 . ■■ 
<212> DNA 
<213>; Homo sapiens 

<400> eo . 

tgaagagccg cgcggtggag ctgctgcccg acgggactgc caaccttgcc aagctgcagc 60 
ttgtggtgga gaacagtgcc .cagcgggtca tccacttggc gggtcagtgg gagaagcacc 120 
gggtcccatc ctcgtgagta ccgccactcc gaaagctgca ggattgcaga gagctggaat 180 
cttctcgacg gctggcagag acccaagaac tgcaccagag tgtccgggcg gccgctgaag 240 
aggcccgcag gaaggaggag gtctataagc agctgatgtc agagctggag actctgccca 300 
gagatgtgtc ccggctggcc tacacccagc gcatcctgga gatcgtgggc aacatccgga 360 
agcagaagga agag|atcacc aagatcttgt ctgatacgaa ggagcttcag aaggaaatca, 420 , 
actccctatc tgggaagctg gaccggacgt ttgcggtgac tgatgagctt gtgttcaagg 480 
atgccaagaa ggacgatgct gttcggaagg cctacaagta tccagctgct ctgcacgaga 540. 
actgcagcca gctcatccag accatcgagg acacaggcac catcatgcgg gaggttcgag 600 
ac 602 

<210> 61 

<2ll> 1368 

<212> DNA 

<213> Homo sapiens 

<400> 61 , 

ccagtgagcg cgcgtaatac gactcactat agggcgaatt gggtaccggg ccccccctcg 60 
agcggccgcc cttttttttt tttttttatt gatcagaatt caggctttat tattgagcaa 120 
tgaaaacagc taaaacttaa ttccaagcat gtgtagttaa agtttgcaaa gtgggatatt 180 . 
gttcacaaaa cacattcaac gtttaaacac tatccacttg aagaacaaaa tatatttaaa 240 
attgtttg.Gt tctaaaaagc ccatttccct ccaagtctaa actttgtaat ttgatattaa 300 
gcaatgaagt tattttgtac aatctagttja aacaagcaga atagcactag gcagaataaa 360 
aaattgcaca gacgtatgca attttccaag atagcattct ttaaattcag ttttcagctt 420 
ccaaagattg gtcgcccata atagacctaa acatataatg atggctaaaa aaaaCaagta 480 
tacgaaaatg taaaaaagga aatgtaagtc cactctcaat ctcataaaag gtgagagtaa 54 0 
ggatgctaaa gcaaaataaa tgtaggttct tctcttccgt ttccgtctat cacgcaatct 600 
gcttctttga tatgccttag ggttacccat ttaagttaga ggttgtaatg caatggtggg 660 
aatgaaaatt gatcaaatat: acaccttgtc atttcatttc, aaattgcggg ctggaaactt 720 
ccaaaaaaag ggtaggcatg aagaaaaaaa aaatcmaatc, agaacctctt caggggtttg 780 
kgktctgata tggcagacar gatacaagtc ccaccaggag atggagcaat tcaaaataag .840 
ggtaatgggc tgacaaggta ttattgccag catgggacag aatgagcaac aggctgaaaa 900 
gtttttggat tatatagcac ctagagcctc tgatgtaggg aatttttgtt agtcaaacat 960 
acgctaaact tccaagggaa aatctttcag gcagcctaag cttgcttctc cagagtgatg 1020 
agttgcattg ctactgtgat ttcttgaaaa caaactgggt ttgtacaagt gagaaagact 1080 
agagagaaag attttagtct gtttagcaga agccatttta tctgcgtgca catggatcaa 1140 
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tatttctgat cccctatacc ccaggaaggg caaaatccca aagaaatgcg ttagcaaaat 1200 

tggctgatgc taccatattg ctatggacat tgatcttgcc caacacaatg gaattccacc 1260 

acactggact agtggatcca ctagttctag agcggccggc caccgcggtg gagctccagc 1320 

ttttgttccc tttagtgagg gttaattgcg cgcttggcgt aatcatnn 1368 

<210> 62 - 

<211>' 924 ... 
<212> DNA' '■' . 

<213> Homo sapiens 

<400> 62 

caaaggnaca ggaacagctt . gnaaagtact gncatncctn cctgcaggga ccagcccttt 60 
gcctccaaaa gcaataggaa atttaaaaga tttncactga gaaggggncc acgtttnart 12 0 
tntnaatgtn tcargnanar tnccttncaa atgncrnctn cactnactnr gnatttgggt 18 0 
tnccgnrtnc mgnactatnt caggtttgaa aaactggatc tgccacttat cagttatgtg 24 0 
accttaaaga actccgttaa ttcctcagag cctcagtttc cttgtctata agctgggagt 300 
aatattaata ctatcatttt tccaaggatt gatgtgaaca ttaatgaggt gaaatgacag 360 
atgtgtatca tggttcctaa taaacatcca aaatatagta cttactattg tcattattat 420 
tactcgcttg aagctaaaga cctcacaata gaatcccacc cagcccacca gacagagytc 480 
tgagttttct agtttggaag agctattaaa taacaacktc tagtgtcaat tctatacttg 540' 
ttatggtcaa gtaactgggc tcagcatttt acattcattg tctctttaag tcctagcaat 600 
gtgaagcagg aactatgatt atattgacta cataaacgaa gaaattgagg ctcagataca 660 
ttaagtaatt ctcccagggt cacacagcta gaactggcaa agcctgggac tgatccatga 720 ■ 
tcttccagca' ttgaagaatc ataaatgcaa ataactgcaa ggccttttcc tcagaagagc 780 - 
tcctggtgbt tgcaccaacc cactagcact tgtitctctac aggggaacat ct'gtgggcct 840 ° 
gggaatcact gcacgtcgca agagatgttg' cttctgaitga atcattgttc ctgccagcgg 900 
tgtgaaggca aaaaaaaiaaa aaaa ' ' ' ■■■924 • 

<210> 63 

<211> 1079 

<212> DNA 

<213> Homo sapiens 



<400> 63 

agtcccaaga actcaataat ctcttatgtt 
tatttcggtg cctgaatgga aaaatataaa 
tggaatccag ctggcagcta taagcaccgt 
cactaaatgg cctcacatcc tgaatgcagg 
taatgttgaa ttccgaaaac acaaccataa 
tagtacatta tttcctccac agcaaaccta 
acaatcaggg caaaacccac acttgaaaag 
gaagacccca gtgatcacta g;gaaatetac 
caaacttcg^ ggaataatgt gtccctcttc 
acgaaigttta caagcagcag ttattccaag 
ctggcaatgt ttaggtttgc ccaaaaactc 
accacatctg' gtaacctctc gatcccttag 
tgactctgga gcctcttgca ttttctttaa 
agcatgccct ctggt'gctct ccaaatggga 
attcacatgc acacacaaaa ggcttctcat 
ttggcttttt aatttcactc ttgatttctt 
atgacbtgta ataatctcat aattacttga 
aataacttec tgtagaaata tcacatctgg 



ttcttttgaa gacttatttt aaatattaac .60 
cattagctca gagacaatgg ggtacctgtt 120 
tgaaaactct gacaggcttt gtgccictttt 180 
aatgtgttcg tttaaataaa cattaatctt 240 
atcatagttg gtttttctgc gaeaacgacc 300 
cctttccaga aggtggaaat tgtatttgca 360 
cattttacaa tattatatct: aagttgcaca 420 
cacagtccag tttttctaat ccaagaaggt 480 
tgctgctgct ctgaaaaata ttcgatcaaa 540 
attagagttc atttgtgtat "cccatgtata 600 
ccagacatcc acaatgttgt tgggtaaacc 660 
atttgtatct cctgcaaata taactgtagc 720 
aaccattttc aactgattca ttcgttccgc 780 
tgtcataagg caaagctcat ttcctgacac 84 0 
cattttggta cttggeLaaag gaataatctc 900 
caacattata gctgtgaaat atccttcttc 960 
tctcttcttt aggcagctat aatatggggg 1020 
gctgtacaaa gctaagtagg- aacacaccc 1079 



<210> 64 
<211> lOOi 
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<212> DNA 

<213> Homo sapiens 

<400> 64 

gaatgtgcaa cgatcaagtc agggtatctg tggtatccac cactttgagc atttatcgat 60 

tctatatgtc aggaacattt caagttatct gttctagcaa ggaaatacaa aacacctata 120 

gttaactatg gcctatctac agtgcaacta aaaactagat tttattcctt tccacccgtg 180 

ggtttgtatt catttaccac cctcttttca ttccctttct cacccacaca ctgtgccggg 240 

cctcaggcat atactattct actgtctgtc tctgtaagga ttatcatttt agcttccaca 300 

tatgagagaa tgcatgcaaa gtttttcttt ccatgtctgg ctcatctcac ttaacataat 360 

gacctccgct tccatccatg ttatttatat tacccaatag tgttcataaa tatacataca 420 

cacatatata ccacattgca tttgtccaat tattcattga cggaaactgg ttaatgttat 480 

atcgttgcta ttgtggatag . tgctgcaaca aacacgca.ag tggggatata atttgaagag 540 

tttttttgtt gacgttcccc caaattttaa gattgttttg tctacgtttg tgaaaatggc 600 

gttagtattt tcatagagat tgcattgaat ctgtagattg ctttgggtaa gtatggctat 660 

tttgatggta tfeaatttttt cattccatga agatgagatg tctttccat): gtttgtgtcc 720 

tctacatttt ...ctttcatcaa agttttgttg- tatttttgaa gtagatgtat ttcaccttac ,780, 

agaccaagtg tattccctaa atattttatt ttcgtagcta ttgtagatga aattgccttc 840 

ttgatttctt tttcacttaa. ttcattatta gtgtatggaa atgttatgga tttttatttg 9.00. 

ttggttttta atcaaaaact gtatraaact tagagctttt tgtggagttt ttaagttttt. 960 
ctagatataa gatcatgaca tctaccaaaa aaaaaaaaaa . a IO91 

<210> 65 

<211> 575 

<212> DNA 

<213> Homo sapiens 

<400> 65 

acttgatata aaaaggatat ccataatgaa tattttatac tgcatccttt acattagcca 60 
ctaaatacgt tattgcttga cgaagacctt tcacagaatc qtatggattg cagcatttca 120 
cttggctact tcatacccat gccttaaaga ggggcagttt ctcaaaagca gaaacatgcc 180 
gccagttctc aagttttcct cctaactcca tttgaatgta agggcagctg gcccccaatg 240 
tggggaggtc cgaacatttt ctgaattccc attttcttgt tcgcggctaa atgacagttt 300 
ctgtcattac ttagattccc gatctttccc aaaggtgttg atttacaaag aggccagcta 360 
atagccagaa atcatgaccc tgaaagagag atgaaatttc aagctgtgag ccaggcagga 420 
gctccagtat ggcaaaggct cttgagaatc agccacttgg tacaaaaaag atttttaaag 480 
cttttatgtt ataccatgga gccatagaaa ggctatggat tgtttaagaa ctattttaaa 540 
gtgttccaga cccaaaaagg aaaaaaaaaa aaaaa 575 

<210> 66 

<211> 831 

<212> DNA 

<213> Homo sapiens 

<400> 66 • 

atcgggctcc ttctgctaaa cagccacatt gaaatggttt aaaagcaagt cagatcaggt 60 
gatttgtaaa attgtattta tctgtacatg tatgggcttt taattcccac caagaaagag 120 
agaaattatc tttttagtta aaaccaaatt tcacttttca aaatatcttc caacttattt 180 
attggttgtc actcaattgc ctatatatac atacatatat gtgtgtgtgt gtgtgtgcgc 240 
gtgagcgcac gtgtgtgtat gcgtgcgcat gtgtgtgtat gtgtattatc agacataggt 300 
ttctaacttt tagatagaag aggagcaaca tctatgccaa atactgtgca ttctacaatg 360 
gtgctaatct cagacctaaa tgatactcca tttaatttaa aaaagagttt taaataatta 420 
tctatgtgcc tgtatccccc ttttgagtgc tgcacaacat gttaacatat tagtgtaaaa 480 
gcagatgaaa caaccacgtg ttctaaagtc tagggattgt gctataatcc ctatttagtt 540 
caaaattaac cagaattctt ccatgtgaaa tggaccaaac tcacattatt gttatgtaaa 600 
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tacagagttt taatgcagta tgacatccca 
tgttatcaaa tattttatag aatacaatga 
tcccatgaca gattcgagac ttgtcaatag 
tgatttgaaa aacatcatta aatatcttta 

<210> 67 
<211> 590 
<212> DNA 

<213> Homo sapiens 



caggggaaaa gaatgtctgc agcgggcgac 660 
acggtgaaca gactggtaac ttgtttgagt 720 
caaatcattt ttgtatttaa atttttgtac 780 
aaagtaaaaa aaaaaaaaaa a 831 



<400> 67 

gtgctctgtg tattttttta ctgcattaga 
ttaaaggctc tttgtgacca tgtttccctt 
ttctccctgg- attagcagtt taaatgaaac 
taaaaatttg ccttaatgta tcagttcagc 
ttgaattaaa gaaaaaaaaa ttctcaatca 
ttaaaacaca tttcaaatag aagtgagttt 
ttgttccttt tccctgtgcc tgtgtcaaat 
caaagtttcc tgtagttgtg ttagttcttt 
gaagcagact tttcttttaa aagaattatt 
aaattccttt ttatggctta tatacctaca 



cattgaatag 


taatttgcgt 


taagatacgc 


60 


tgtagcaata 


aaatgttttt 


tacgaaaact 


120 


agagttcatc 


aatgaaatga 


gtatttaaaa 


18 0 


tcacaagtat 


tttaagatga 


ttgagaagac 


240 


tatttttaaa 


atataagact 


aaaattgttt 


300 


gaactgacct 


tatttatact 


ctttttaagt 


360 


cttcaagtct 


tgctgaaaat 


acatttgata 


420 


tgtcatgtct 


gtttttggct 


gaagaaccaa 


480 


tctctttcaa 


atatttctat 


cctttttaaa 


540 


tatttaaaaa 


aaaaaaaaaa 




590 



<211> 301 
<212> DNA 
<213> Homo sapiens 

<400> 68 



ttgtgttggg 


gttccctttt 


ccggtcggcg 


tggccttgcg 


agtggagtgt 


ccgctgtgcc 




cgggcctgca 


ccatgagcgt 


cccggccttc 


accgacatca 


gtgaagaaga 


tcaggctgct 


120 


gagcttcgtg 


cttatctgaa 


atctaaagga 


gctgagattt 


cagaagagaa 


ctcggaaggt 


180 


ggacttcatg ttgatttagc 


tcaaaccatt 


gaagcctgtg 


atgtgcgtct 


gaaggaggat 


240 


gataaagatg 
c 


ttgaaagtgt 


gatgaacagt 


ggggnatcct 


actcttgatc 


cggaanccna 


300 
301 



<210> 69 
<211> 301 
<212> DNA 

<213> Homo sapiens 



tctatgagca tgccaaggct ctgtgggagg atgaaggagt gcgtgcctgc tacgaacgct 60 

ccaacgagta ccagctgatt gactgtgccc agtacttcct ggacaagatc gacgtgatca 120 

agcaggctga ctatgtgccg agcgatcagg acctgcttcg ctgccgtgtc ctgacttctg 180 

gaatctttga gaccaagttc caggtggacn aagtcaactt ccacatgntt gacgtgggtg 240 

gccagcgcga tgaacgccgc aagtggatcc agtgcttcaa cgatgtgact gccatcatct 300 

t 301 



<210> 70 

<211> 201 

<212> DNA 

<213> Homo sapiens 



<400> 70 

gcggctcttc ctcgggcagc ggaagcggcg cggcggtcgg agaagtggcc taaaacttcg 



60 
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gcgttgggtg aaagaaaatg gcccgaacca agcagactgc tcgtaagtcc accggtggga 120 
aagccccccg caaacagctg gccacgaaag ccgccaggaa aagcgctccc tctaccggcg 180 
gggtgaagaa gcctcatcgc t 201 



<210> 71 

<211> 301 

<212> DNA 

<213> Homo sapiens 

<4P0> 71 

gccggggtag tcgccgncgc cgccgccgct gcagccactg caggcaccgc tgccgccgcc 60 

tgagtagtgg gcttaggaag gaagaggtca tctcgctcgg agcttcgctc ggaagggtct 120 

ttgttccctg cagccctccc acgggaatga caatggataa aagtgagctg gtacanaaag 180 

ccaaactcgc tgagcaggct gagcgacatg atgatatggc tgcagccatg aaggcagtca 240 

cagaacaggg gcacgaactc ttcaacgaag agagaaatct gctctctggt gcctacaaga 300 

a 301 

<210> 72 
<211> 251 
<212> DNA 
<213> Homo sapiens 

<400> 72 

cttggggggt gttgggggag agactgtggg cctggaaata 
caccctgtac cctagcctgc acctgtccac atctctgcaa 
gtctctgtgc actctgtctt ggatgctctg gggagctcat 
gagggaggct caggggactg gttgggccag ggatgaatat 
aagagccaan g 



aaacttgtct cctccaccac 6 0 

agttcagctt ccttccccag 120 
gggtggagga gtctccacca 180 
ttgagggata aaaattgtgt 24 0 

251 



<210> 73 

<211> 913 

<212> DHA 

<213> Homo sapiens 



<400> 73 



tttttttttt 


tttttcccag 


gccctctttt 


tatttacagt 


gataccaaac 


catccacttg 


60 


caaattcttt 


ggtctcccat 


cagctggaat 


taagtaggta 


ctgtgtatct 


ttgagatcat 


120 


gtatttgtct 


ccactttggt 


ggatacaaga 


aaggaaggca 


cgaacagctg 


aaaaagaagg 


180 


gtatcacacc 


gctccagctg 


gaatccagca 


ggaacctctg 


agcatgccac 


agccgaacac . 


240 


ttaaaagagg 


aaagaaggac 


agctgctctt 


catttatttt 


gaaagcaaat 


tcatttgaaa 


300 


gtgcataaat 


ggtcatcata 


agtcaaacgt 


atcaattaga 


ccttcaacct 


aggaaacaaa 


360 


attttttttt 


tctatttaat 


aatacaccac 


actgaaatca 


tttgccaatg 


aaccccaiaag 


420 


atttggtaca 


aatagtacaa 


tccgtatttg 


ctttcctctt 


tcctttcttc 


agacaaacac 


480 


caaataaaat 


gcaggtgaaa 


gagatgaacc 


acgactagag 


gctgacttag 


aaatttatgc 


540 


tgactcgatc 


taaaaaaaat 


tatgttggtt 


aatgtcaatc 


catctaaaac 


agagcatttt 


600 


gggaatgctt 


ttcaaagaag 


gtcaagtaac 


agtcatacag 


ctagaaaagt 


ccctgaaaaa 


660 


aagaattgtt 


aagaagtata 


ataacctttt 


caaaacccac 


aatgcagctt 


agttttcctt 


720 


tatttatttg 


tggtcatgaa 


gactatcccc 


atttctccat 


aaaatcctcc 


ctccatactg 


780 


ctgcattatg 


gcacaaaaga 


ctctaagtgc 


caccagacag 


aaggaccaga 


gtttctgatt 


840 


ataaacaatg 


atgctgggta 


atgtttaaat 


gagaacattg 


gatacggatg 


gccagcccaa 


900 


cacaatggaa 


ttc 










913 
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<210> 74 
<211> 351 
<212> DNA 

<213> Homo sapiens 
<400> 74 

tgtgcncagg ggatgggtgg gcngtggaga ngatgacaga aaggctggaa ggaanggggg 60 
tgggtttgaa ggccanggcc aaggggncct. caggtccgnt tctgnnaagg gacagccttg 120 
aggaaggagn catggcaagc catagctagg ccaccaatca gattaagaaa nnctgagaaa 180 
nctagctgac catcaccgtt ggtgnccagt ttcccaacac aatggaatnc caccacactg 24 0 
gactagngga hccactagtt ctagagcggc cgccaccgcg gtggaacccc aacttttgcc 300 
cctttagnga gggttaatcg cgcgcttggc . ntaatcatgg tcataagctg t 351 

<210> 75 ' 

<2X1> 251 

<212> DNA 

<213> Homo sapiens 

<400> 75 

tacttgacct tctttgaaaa gcattcccaa aatgctctat tttagataga ttaacattaa SO 

ccaacataat tttccttaga ccgagtcagc ataaatttcc aagtcagcct ctagtcgcgg 120 

ttcatctcct tcacctgcat tttatttggt gtttgtctga agaaaggaaa gaggaaagca 180 

aatacgaatt gtactatttg taccaaatct ttgggattca ttggcaaata atttcagtgt 240 

ggtgtattat t 251 

*210> 76 

<211> 251 

<212> DNA 

<213> Homo sapiens 

<400> 76 

tatttaataa tacaccacac tgaaattatt tgccaatgaa tcccaaagat ttggtacaaa 60 

t:agtacaat:t egtatttgcc ttcctctttc ctttcttcag acaaacacca aataaaatgc 120 

aggtgaaaga gatgaaccac gactagaggc tgacttagaa atttatgctg actcgaccta 18 0 

aaaaaaatta tgtcggttaa tgttaatcta tctaaaatag agcattttgg gaatgctttt 24 0 
caaagaaggt c 

<210> 77 
<2li> 351 
<212> DNA 

<213> Homo sapiens 
<400> 77 

actcaccgtg ctgtgtgctg tgtgcctgct gcctggcagc ctggccctgc cgctgctcag 60 
gaggcgggag gcatgagtga gctacagcgg gaacaggctc aggactaccc caagagannn 12 0 

tatctctatg actcagaaac aaaaaatgcc aacagtttag aagccaaact caaggagatg 180 
caaaaattct ttggcctacc tataactgga atgttaaact cccgcgtcat agaaataatg 24 0 
cagaagccca gatgtggagt gccagatgtt gcagaatact cactatttcc aaatagccca 300 
aaatggactt ccaaagtggt cacctacagg atcgtatcat atactcgaga c 351 

<210> 78 

■<211> 1592 • 
<212> DMA' 
<213> Homo sapiens 
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<400> 78 

gaattccatt gtgttggggc cctgggggcg gaggggaggg gcccaccacg gccttatttc 60 

cgcgagcgcc ggcactgccc gctccgagcc cgtgtctgtc gggtgccgag ccaactttcc 120 

tgcgtccatg cagccccgcc ggcaacggct gcccgctccc tggtccgggc ccaggggccc 180 

gcgccccacc gccccgctgc tcgcgctgct getgttgctc gccccggtgg cggcgcccgc 240 

ggggtccggg gaccccgacg accctgggca gcctcaggat gctggggtcc cgcgcaggct 300 

cctgcagcag gcggcgcgcg cggcgcttca cttctccaac ttccggcccg gctcgcccag 360 

cgcgctgcga gtgccggccg aggtgcagga gggccgcgcg tggattaatc caaaagaggg 420 

atgtaaagtt cacgtggtct tcagcacaga gcgctacaac ccagagtctt tacttcagga 480 

aggtgaggga cgtttgggga aatgttctgc tcgagtgttt ttcaagaatc agaaacccag 540 

accaactatc aatgcaactc gtacacggct catcgagaaa aagaaaagac aacaagagga 600 

ttacctgctt tacaagcaaa tgaagcaact gaaaaacccc ttggaaatag tcagcatacc 660 

tgataatcat ggacatattg atccctctct gagactcatc tgggatttgg ctttccttgg 720 

aagctcttac gtgatgtggg aaatgacaac acaggtgtca cactactact tggcacagct 7B0 

cactagtgtg aggcagtgga aaactaatga tgatacaatt gattttgatt atactgttct 840 

acttcatgaa ttatcaacac aggaaataat tccctgtcgc attcacttgg tctggtaccc 900 

tggcaaacct cctaaagtga agtaccactg tcaagagcta cagacaccag aagaagcctc 960 

cggaactgaa gaaggatcag ctgtagtacc aacagagctt agtaat.ttct aiaaaagaaaa 1020 

aatgatcttt ttccgacttc taaacaagtg actatactag cataaatcat tcttctagta 1080 

aaacagctaa ggtatagaca ttctaataat ttgggaaaac ctatgattac aagtaaaaac 1140 

tcagaaatgc aaagatgtcg gttttttgtt tctcagtctg ctttagcttt taactctgga 1200 

agcgcatgca cactgaactc tgctcagtgc caaacagtca ccagcaggtt cct.cagggtt 1260 

tcagccctaa aatgtaaaac ctggataatc agcgtatgtt gcaccagaat cagcattttt 1320 

tttttaactg caaaaaatga tggtctcatc tctgaattta tatttctcat tcttttgaac 1380 

atactacagc taatatattt tatgttgcta aattgcttct atctagcatg ttaaacaaag 1440 

ataatatact ttcgatgaaa gtaaattata ggaaaaaaat taactgtttt aaaaagaact 1500 

tgattatgtt ttatgatttc aggcaagtat tcatttttaa cttgctacct acttttaaat 1560 

aaatgtttac atttctaaaa aaaaaaaaaa aa 1592 

<210> 79 

<211> 401 

<212> DNA 

<213> Homo sapiens 

<400> 79: . 

catactgtga attgttcttg actccttttc ttgacattca gttttcanaa tttccatctt 60 

tcttctggaa ctaatgtgct gttctcttga ctgcctgctg ggccagcatc cgattgccag 120 

ccagaaacgt cacactgccc aagatggcca ggtacttcaa ggtctggaac atgttgagct 180 

gagtccagta gacatacatg agtcccagca tagcagcatg tcccaggtga aatataatcg 240 

tgctaggagc aaaagtgaag ttggagacat cggcaccaat ccggatccac tagttctaga 300 

gcggccgcca ccgcggtgga gctccagctt ttgttccctt tagtgagggt taattgcgcg 360 

cttggcgtaa tcatggncat agctgtttcc tgtgtgaaat t 401 

<210>. 80 . 
<211> 301 
<212> DNA 
<213> Homo, sapiens 

<400> 80 

aaaaatgaaa catctatttt agcagcaaga ggctgtgagg gatggggtag aaaaggcatc 6 0 

ctgagagagc tctagaccga cccaggtcct gtggcacact atacgggtca ggaggggtgg 120 

aagacaggcc taagctctag gacggtgaat ctcggggcta tttgtggatt tgttagaaac 180 

agacattctt ttggcctttt cctggcactg gcgttgccgg caggtgggca gaagtgagcc 240 

accagccact gttcagtcat tgccaccaca gatcttcagc agaatcttcc ggtaatcccc 300 

t 301 
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<210> 81 

<211> 301 

<212> DNA 

<213> Homo sapiens 



<400> 81 



tagccaggtt 


gcccaagcta 


attttattct 


ttcccaacag gatccatttg 


gaaaatatca 


60 


agcctttaga 


atgtggcagc 


aagagaaagc 


ggactacgca 


ggaacgggga 


gtttgggaga 


. 120 


agctctcctg 


gtgttgactt 


agggatgaag 


gctccaggct 


gctgccagaa 


atggagtcac 


180 


cagcagaaga 


actgntttct 


ctgataagga 


tgtcccacca 


ttttcaagct 


gttcgttaaa 


240 


gttacacagg 
t 


tccttcttgc 


agcagtaagt 


accgttagcc 


cattttccct 


caagcgggtt 


300 
3 01 



<210> 82 . 

<211> 201 

<212> DNA 

<213> Hcwno sapiens 

<400> 82 

tcaacagaca aaaaaagttt attgaataca aaactcaaag gcatcaacag tcctgggccc 6 0 

aagagatcca tggcaggaag , tcaagagttc tgcttcaggg tcggtctggg cagccctgga 120 

agaagtcatt gcacatgaca gtgatgag|tg ccaggaaaac agcatactcc tggaaagtcc 180 

acctgctggn cactgrittca t 201 

<2lb> 83 
<211> 251 
<212> DiNA 
<213> Homo sapiens 

<400> 83 

gtaaggagca tactgtgccc attcattata 
agcctcccca gtttaaaagc acttaacaag 
cccaaaccgg ctccctctta ccaagtaccg 
tcttgatatg aacaatcaaa gcatttaacg 
cattttccaa a 

<210> 84 
<211> 301 
<212> DNA 
<213> Homo sapiens 



<400> 84 

agtttataat gttttactat gatttagggc ttttttttca aagaacaaaa attataagca 60 

taaaaactca ggtatcagaa agactcaaaa ggctgttttt cactttgttc agattttgtt 120 

tccaggcatt aagtgtgtca tacagttgtt gccactgctg ttctccaaat gcccgatgtg 18 0 

tgctatgact gacaactact tttctctggg tctgatcaat tttgcagtari accattttag 240 

ttcttacggc gtcnataaca aatgcttcaa catcatcagc tccaatctga agtcttgctg 300 

c 301 



<:210> BS 

<2l'x">.20l' 

<212> DNA 

<213> Homo sapiens 



gaatgcagtt aaaaaaaata ttttgaggtt 60 
aaacacttgg acagcgatgc aatggtctct 120 
taaacagggt ttgagaacgt tcaatcaatt 180 
caaacatatt tgcttctcaa anaataaaac 240 
251 
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<400> 85 

tatttgtgca tgtaacattt atcgacatct acccactgca agtatagatg aataagacac 60 

agtcacacca taaaggagtt tatccttaaa aggagtgaaa gacattcaaa aaccaaccgc 120 

aataaaaaag ggtgacataa ttgctaaatg gagtggagga acagtgctta tcaattcctg 180 

attgggccac aatgatatac c 201 



<210> 86 
<211> 301 
i:212> DNA 
<213> Homo 



sapiens 



<40a> 66 

tttataaaat attttattta cagtagagct ttacaaaaat agtcttaaat taatacaaat 60 

cccttttgca atataactta tatgactatc ttctcaaaaa cgtgacattc gattataaca 120 

cataaactac atttatagtt gttaagtcac cttgtagtat aaatatgttt tcatcttttt 180 

tctgtaataa ggtE(catacc aat-aacaatg aacaatggac aacaaatctt attttgntat 24 0 

tcttccaatg taaaattcat ctctggccaa aacaaaacta accaaagasia agtaaaacaa 300 

t . 301 

<210> 87 . 
<211> 351. 
<212> DNA 

<213> Homo sapiens 
<400> 87 . 

aaaaaagatt taagatcata aataggtcat tgctgtcaca acacattt.ca gaatcttaaa 60 

aaaacaaaca ttttggcttt, ctaagaaaaa gacttttaaa aaaaatcaat tccctcatca 120 

ctgaaaggac ttgtacattt ttaaacttcc agtctcctaa ggcacagtat tt'aatcagaa 180 

tgccaatatt accaccctgc tgtagcanga ataaagaagc aagggattaa cacccaaaaa 240 

aacngccaaa ttcctgaacc aaatcattgg cattttaaaa aagggataaa aaaacnggnt 300 

aaggggggga gcattttaag taaagaangg ccaagggtgg tatgccngga c 351 



<210> .88 
<:211> 301 
<212> DNA 
<213> Homo sapiei 



<400> 88 

gttttaggtc tttaccaatt tgattggttt atcaacaggg catgaggttt aaatatatct 60 

ttgaggaaag gtaaagtcaa atttgacttc ataggtcatc ggcgtcctca ctcctgtgca 120 

ttttctggtg gaagcacaca gccaattaac tcaagtgtgg cgntagcgat gctttttcat 180 

ggngtcatct atccacttgg. tgaactcgca cacttgaatg naaactcctg ggtcattggg 240 

ntggccgcaa gggaaaggtc cccaagacac caaaccttgc agggtacctn tgcacaccaa 300 

c . 301 



<210> 89.. 
<211> 591 
<212> DNA 
<213> Homo sapiens 

<400> 89 

tttttttttc tttttttatt aatcaaatga ttcaaaacaa ccatcattct gtcaatgccc 60 

aagcacccag ctggtcctct ccccacatgt cacactctcc tcagcctctc ccccaaccct 120 

gctctccctc ctcccctgcc ctagcccagg gacagagtct aggaggagcc tggggcagag 180 

ctggaggcag gaagagagca ctggacagac agccatggtt tggattgggg aagagattag 240 
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gaagtaggtt cttaaagacc cttttttagt accagatatc cagccatatc cccagctcca 300 

ttattcaaat catttcccat agcccagctc ctctctgttc tccccctact accaattctt 360 

tggctctcac acaattttta tccctcaaat attcatccct ggcccaacca gtcccctgag 420 

cctccctctg gtggagactc ctccacccat gagctcccca gagcatccaa gacagagtgc 480 

acagagacct ggggaaggaa gctgaactct gcagagatgt ggacaggtgc aggccagggt 54 0 

acagggtggt ggtagaggag acaagtttca tttccaggcc cacagtctct c 591 



<210> 90 

<211> 1996 

<212> DNA 

<213> Homo sapiens 



<400> 90 

tttttttttt ttttttatca aatgaatact ttattagaga cataacacgt ataaaataaa 60 

tttcttttca tcatggagtt accagatttt aaaaccaacc aacactttct catttttaca 120 

gccaagacac gttaaattct taaatgccat aacttttgtc caactgcttt gtcattcaac 180 

tcacaagtct agaatgtgat taagctacaa atctaagtat tcacagatgt gtcttaggct 24 0 

tggtttgtaa caatctagaa gcaatctgtt tacaaaagtg ccaccaaagc attttaaaga 300 

aaccaattta atgccaccaa acataagcct gctatacctg ggaaacaaaa aatctcacac 360 

ctaaattcta gcagagtaaa cgattccaac tagaatgtac tgtacatcca tatggcacat 420 

ttatgacttt gtaatatgta attcataata caggtttagg tgtgtggtat ggagctagga 480 

aaaccaaagt agtaggatat tatagaaaag atctgatgtt aagtataaag tcatatgcct 540 

gatttcctca aaccttttgt ttttcctcat gtcttctgtc tttatatttt tatcacaaac 600 

caagatctaa cagggttctt tctagaggat tattagataa gtaacacttg atcattaagc 660 

acggatcatg ccacccattc atggttgttc tatgttccat: gaactctaat agcccaactt 720 

atacat'ggca ccccaagggg atgcttcagc ca^aaagtaa agggctgaaa aagtagaaca 780 

atacaaaagc cctcgtgtgg tgggaactgt ggcctcacto ttacttgtcc ttccattcaa 840 

aacagtttgg cacctttcca tgacgaggat ctctacaggt aggttaaaat acttttctgt 900 

gctattcagc cagaaatagt ttttgtgctg gatatgattt taaaacagat tttgtctgtc 960 

accagtgcaa aaacattaca gatgtctiggg ctaatacaaa aacacataag aatctacaac 1020 

tttatattta ataccctatt caaatttaac tcaaagtaat gcaaaataat- tagaagtaaa 1080 

aacttaattc ttctgagagc tctatttgga aaagctccac atatccacac acaaatatgg 1140 

gtatattcat gcacagggca aacaactgta ttctgaagca taaataaact caaagcaaga 1200 

catcagtagc tagataccag ttccagtatc ggttaatggC ctctggggat cccattttaa 1260 

gcactctcag atgaggatct tgctcagttg ttagactatc attagtttga ttaagcaact 1320 

gaagtttact tcataaatta ctttttccta tatccaggac tccgcctgag aaattttata 1380 

cattcctcca aaggtaagta ttctccaaag gtaagtattt gactattaac acaaaggcaa 144 0 

tgtgattatt gcataatgac actaaatatt atgtggcttt tctgttaggt ttataagttt 1500 

tcaatgatca gttcaiagaaa atgcagatca tatataacta aggttttaca ccagtggttg 1560 

acaaactatg gcccacaggc taaacccagc ctccccttgt ttttataaat aagttttatt 1620 

agacataacc acactcattc atttctgtat tgtgtatagc tgctttcacg ctatactagc 1680 

agaactgaat agtcgtgaca gagactgtat ggaccgtgaa gcacaaatat ttaccatctg 1740 

gcccattcta aaaaaagtgt gccaattcct ggtttacact aaaatataga gtttagtggg 1800 

aagcctattt gaaatgtgtt tkttttaggg gctgtaatta ccaattaaaa ttaaggttca 1860 

ggtgactcag caaccaaaca aaagggatac taatttttta tgaacaatat atttgtattt 1920 

tatggacata aaaggaaact tccagaaaga aaaggaggaa aataaagggg gaaagggacc 1980 

caacacaatg gaattc 1996 



<210> 91 
<211> 911 
<212> DNA 
<213> Homo 



sapiens 



<400> 91 ■ 

gccctttttt tttttttttt cttgtttaaa aaaattgttt tcattttaat gatctgagtt 



60 



PCTAJS98/274I6 



agtaacaaac aaatgtacaa aattgtcttt cacatttcca tacatcgtgt tatggaccaa 120 

atgaaaacgc tggactacaa atgcaggtct ccctatatcc tcaacttcaa tcactgtcac 180 

ttataaataa aggtgatttg ctaacacatg catttgtgaa cacagatgcc aaaaattata 240 

catgtaagtt aatgcacaac caagagtata cactgttcat ttgtgcagtc atgcgtcaaa , 300 

tgcgactgac acagaagcag ttatcctggg atatttcact ctatatgaaa agcatcttgg 360 

agaaatagat . tgaaatacag ,t.ttaaaacaa aaactgcact ctacaaatac aataaaaccc . 420 

gcaacttgca catctgaagc aacatttgag aaagctgctt caataaccct gctgttatat 480 

tggttttata ggtatatctc caaagtcatg ggtcgggata tagccgcttt aaagaaaata 54 0 

aatatgtata ttaaaaggaa aatcacactt taaaaatgtg aggaaagcct tgaaaacagt 600 

cttaatgcat gagtccatct acatattttc aagttttgga aacagaaaga agtttagaat 660 

tttcaaagta atctgaaaac tttctaagcc attctaaaat aagatttttt tccccatctt 720 

tccaatgttt cctatttgat agtgtaatac agaaatgggc agtttctagt gtcaacttaa 780 

ccgtgctaat tcataagtca ttacacattt atgacttaag agttcaaaca agtggaaatt 840 

gggttataat gaaaatgaca agggggcccc ttcagcagcc actcatctga actagcaacc 900 
ccaacacaat g .911 

<2lb> 92 . 

<211> 1710 

<212> DNA 

<213>. Homo sapiens 

<400> 92 

•tttttttttt tttttaactt ttagcagtgt ttatttttgt taaaagaaac caattgaatt 60 

gaaggtcaag ■ acaccttictg actgcacaga ctaaacaaga aagcattacc tatttcaact. . 120 
ttacaaagca tcttattgat ttaaaaagat. ccatacta,tt gataaagttc accatgaaca .. 180 

tatatgtaat aaggagacta aaatattcat tttacatatc tacaacatgt atttcatatt 240 

tctaatcaac cacaaatcat ataggaaaat,. atttaggtcc atgaaaaagt ttca^aacat 300 

taaaaaacta aagttttgaa acaaatcaca tgtgaaagct cattaaataa taacattgac 360 

aaataaatag ttaatcagct ttacttatta gctgctgcca tgcatttctg gcattccatt 420 

ccaagcgagg gtcagcatgc agggtataat ttcatactat gcgaccgtaa agagctacag 480 

ggcttatttt tgaagtgaaa tgtcacaggg tctttcattc tctttcaaag gaagatcact 54 0 

catggctgct aaactgttcc catgaagagt accaaaaaag cacctttctg aaatgttact 600 

gtgaagattc atgacaacat atttttttta acctgttttg aaggagtttt gtttaggaga 660 

ggggatgggc cagcagatgg agggtatctg agaagccctc tcccgcttta aaatataatg 72 0 

attcactgat gtttatagta tcaacagtct tttaagaaca atgaggaatt aaaactacag 780 

gatacgtgga atttaaatgc aaattgcatt catggatata cctacatctt gaaaaacctg 840 

aaaaggaaaa actattccca aagaaggtcc tgatacctaa gacagcttgc tgggttcgat 900 

caaagcagaa agcatatact ttcaagtgag aaaacagcag tggcaggctt gagtcttcca 960 

agcaatcaaa tctgtaaagc agatggttac tagtaagtct agttatggga gtctgagttc 1020 

taactcatgc tgtgcttgct ggatttgctg gctcttttcc gccctctgtg atgctggact 1080 

ggcttgtcag gtgacatgct ctcaaagttg tgactggact cgttgtgctg ccgggtgtac 114 0 

ctcttgcact tgcaggcagt gactactgtg attttgtagg tgcgtgtgct gccatcttgg X200 

cactgcagct ggattctctg ggtacgggtt, ttgtcattga cacaccgcca ctcctgggag 1260 

ctcctcctgc tccagcactt tgttccatag cctcctccaa tccagttagg gagcactggc 1320 

aggggcaagc actcgccagc acacaccagc tccttcagag ggctgatgct ggtgcactgg 1380 

ccatcagaga tgtatttggt ggaacgcagt tcccggcaac ccacttgaac ccgagtgttc 1440 

cgatccagtc cagtgttact gaaatgcctg cccccatttc tggcttgatt caacgtgctg 1500 

ttgctgctgg ggtgtgctgg aacaggttta accacatgtg aacaaaggat ttctgtggca 1560 

ccattttcaa aagccaaaca gcttttcatt aggatgcatg caaggggaag gagatagaaa 1620 

tgaatggcag gaggaagcat ggtgagtaga ggatttgctt gactgaagag ctggttaatt 1680 

cttttgcctc tgcccaacac aacggaattc 1710 



<210>. 93 
<211> 251 
<212> DNA 



wo 99/33869 



27 



PCT/US98«7416 



<213> Homo sapiens 



c4O0> 93 



cccaccctac ccaaatatta gacaccaaca cagaaaagct agcaatggat tcccttctac 
tttgttaaat aaataagtta. aatatttaaai tgcctgtgtc tctgtgatgg caacagaagg' 
accaacaggc cacaccctga taaaaggtaa gaggggggcg gatcagcaaa aagacagtgc 
tgtgggctga ggggacctgg ttcttgcgtg ttgcccctca agactctccc cccacaaata 
actttcatat g ' ' ■ 



"60 
120 
180 
240 
251 



<210> 94 ■ ■ ■ • ■ 

<211> 738 

<212> DNA ■ ■ ■ 
<213> Homo sapiens 

<400> 94 

cccttttttt ttttttttcc acttctcagt tcatttctgg gactaaattt gggtcagagc 60 

tgcagagaag ggatgggccc tgagcttgag gatgaaagtg ccccagggag attgagacgc 120 

aacccccgcc ctggacagtt ttggaaattg ttcccagggt tcaactagag agacacggtc 180 

agcccaacgt gggggaagca gaccctgagt ccaggagaca tggggtcagg ggctggagag 24 0 

atgaacattc tcaacatctc tgggaaggaa tgagggtctg aaaggagtgt cagggctgtc 300 

cctgcagcag gtggggatgc cggtgtgctg agtcctggga tgactcagga gttggcctgg 360 

acggttcccc ggacccactt ggcgaacctg cagaggttcg tgtagacacc cggtctgttg 420 

ggccgggcac aagggtaatc tccccaggac acgagtccct gcagggagcc attgcagacc ' 480 

acaggccccc cagaatcacc ctggcaggag tctctacctg ctttgt'cacc ' ggcgca'gaac 540 

atggtgtcaif ctatctgtct cgggtaagca fccbtcgcacc ttttctgact tagcacgctg 600 
atattc&agc actggaggac cttagggaag tgcacttggg ggctcttggt tgtcecccag • • 660 

ccagacacca agciactttgt cccagcagag ggacaatgag aggiagacgtt gatgggtctg '720 

acatctttag tigggacga 738 
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An isolated nucleic acid comprising the nucleotide sequence 
of SEQ. ID. N. 3; encoding an iimiunogenic portion of a breast 
protein or a variant of "said protein, a vector and a host 
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the polypeptide or antibodies reactive with the polypeptide, 
its use for. the' manufacture of a vaccine, its use for the 
detection of breast proteins or nucleic acids. 
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